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Am386 DE 


High Performance, Low Power, 32-Bit 
Embedded Microprocessor 


DISTINCTIVE CHARACTERISTICS 
@ Member of Am386 E CPU series 

— Am386 microprocessor core 

— Designed for embedded applications 

— Socket compatible with 386DX processor 


@ Industry standard architecture allows use of 
existing peripheral support chips, development 
tools, and application software 


@ Ideal for embedded applications 
— Low power consumption 

— 3-5 V operation 

— Fully static operation 

B@ High performance 

— 33-MHz operating frequency 

— 32-bit architecture 


- 30-bit address bus 
- 32-bit data bus 


— Four levels of hardware-enforced protection 


cl 


Advanced 
Micro 
Devices 


& Optimized for the cost-sensitive embedded 
marketplace 


— Real and protected mode operation without paging 
— Full segmentation unit and descriptor table support 


@ Supports world’s largest software base for x86 
architectures 


@ Compatible with Microsoft at Work™ and 
Novell NEST embedded software 


B® Coprocessor interface; supports 
387DX-compatibie math coprocessor 


@ Advanced packaging options 

— 132-Pin Plastic Quad Flat Pack 

@ Extended temperature versions avallable 

—@ Based on AMD advanced CMOS technology 
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FUNCTIONAL DESCRIPTION 


True Static Operation 


The Am386DE microprocessor incorporates a true 
static design. Unlike dynamic circuit design, the 
Am386DE device eliminates the minimum operating fre- 
quency restriction. It may be clocked from its maximum 
speed of 33 MHz all the way down to 0 MHz (DC). Sys- 
tem designers can use this feature to design energy- 
efficient embedded control devices. 


Standby Mode 


This true static design allows for a standby mode. At any 
of its operating speeds (33 MHz to 0 MHz), the 
Am386DE microprocessor will retain its state {i.e., the 
contents of ail of its registers). By shutting off the clock 
completely, the device enters standby mode. Since 
power consumption is a function of clock frequency, 
operating power consumption is reduced as the fre- 
quency is lowered. In standby mode, typical current 
draw is reduced to less than 20 pA at DC. 


Not only does this feature save battery life, but it also 
simplifies designs in the following ways: 


1. Eliminates the need for software in BIOS to save 
and restore the contents of registers. 


2. Allows simpler circuitry to control stopping of the 
clock since the system does not need to know the 
processor state. 


Lower Operating lcc 


True static design also allows lower operating I,. when 
operating at any speed. 


Performance On Demand 


The Am386DE microprocessor retains its state at any 
speed from 0 MHz (DC) to its maximum operating speed 
of 33 MHz. With this feature, system designers may vary 
the operating speed of the system to extend the battery 
life in portable systems, and reduce power and cooling 
requirements for embedded products. 


For example, the system could operate at low speeds 
during inactivity or polling operations. However, upon 
interrupt, the system clock can be increased up to its 
maximum speed. After a user-defined time-out period, 
the system can be returned to a low (or 0 MHz) operating 
speed without losing its state. This design maximizes 
life while achieving optimal performance. 
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CONNECTION DIAGRAMS 
132-Lead Plastic Quad Flat Pack (PQFP) Package—Top Side View 


OONOMS WN = 


Notes: 

Pin 1 is marked for orientation. 

NC = Not connected; Connection of an NC pin may cause a malfunction or incompatibility 
with future versions of the Am386DE microprocessor. 
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CONNECTION DIAGRAMS (continued) 
132-Lead Plastic Quad Flat Pack (PQFP)—Pin Side View 


AN WEAMRUNDO 


Notes: 

Pin 1 is marked for orientation. 

NC = Not connected; connection of an NC pin may cause a maltunction or incompatibility 
with future versions of the Am386DE microprocessor. 
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CONNECTION DIAGRAMS (continued) 
PQFP Pin Designations (sorted by Functional Grouping) 


Pin Pin Pin Pin 
Name No. Name No. 
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CONNECTION DIAGRAMS (continued) 
PQFP Pin Designations (sorted by Pin Number) 


Pin Pin Pin Pin Pin Pin Pin Pin Pin Pin Pin Pin 
No. Name No. Name | No. Name | No. Name | No. Name | No. Name 
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LOGIC SYMBOL 


2X Clock 


Data Bus 
Address 
Bus Float 
Interrupt 
Control 
Bus 
Cycle 
Control 
Math 
Coprocessor 
Control 
Bus 
Cycle 
Definition 


HLDA 


Bus Arbitration 
Control 
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ORDERING INFORMATION 
Standard Products 


AMD standard products are available in several packages and operating ranges. The order number (Valid Combination) is formed 
by a combination of the elements below. 


Am386DE 


-33 K Cc 


TEMPERATURE RANGE 
C = Commercial (Toase = 0°C to +100°C) 
| = Extended Temperature (Tcase = —40°C to 100°C) 


PACKAGE TYPE 
K = 132-Pin Plastic Quad Flat Pack 


SPEED OPTION 
-33 = 33 MHz 


DEVICE NUMBER/DESCRIPTION 
Am386DE 
Am386DE High-Performance 32-Bit Embedded Microprocessor 


Valid Combinations ' : ¥aed Comminatians 
Valid Combinations lists configurations planned to 
be supported in volume for this device. All speeds 
Am386DE ~83KC may not be available in all package combinations. 
Consutt the local AMD sales office to confirm 


availability of specific valid combinations and to 
Am386DE ~33K\I check on newly released combinations. 
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PIN DESCRIPTION 

A31-A2 

Address Bus (Outputs) 

Outputs physical memory or port I/O addresses. 


ADS 
Address Status (Active Low; Output) 


Indicates that a valid bus cycle definition and address 


are being driven at the AmM386DE microprocessor pins. 


BE3—BE0 
Byte Enables (Active Low; Outputs) 


Indicate which data bytes of the data bus take partina 
bus cycle. 


BS16 
Bus Size 16 (Active Low; input) 
Allows direct connection of 32-bit and 16-bit data buses. 


BUSY 
Busy (Active Low; Input) 


Signals a busy condition from a processor extension. 


CLK2 
Clock (Input) 


Provides the fundamental timing for the Am386DE 
microprocessor. 


D31—D0 
Data Bus (Inputs/Outputs) 


Inputs data during memory, I/O, and interrupt acknowl- 
edge read cycles and outputs data during memory and 
V/O write cycles. 


D/C 
Data/Control (Output) 


Abus cycle definition pin that distinguishes data cycles, 
either memory or I/O, from control cycles which are: 
interrupt acknowledge, halt, and instruction fetching. 


ERROR 

Error (Active Low; Input) 

Signals an error condition from a processor extension. 
FLT 

Float (Active Low; Input) 

An input signal which forces all bidirectional and output 
signals, including HLDA, to the three-state condition. 


FLT has an internal pull-up resistor, and if it is not used it 
should be unconnected. 
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HLDA 
Bus Hold Acknowledge (Active High; Output) 


Indicates that the Am386DE microprocessor has sur- 
rendered control of its local bus to another bus master. 


HOLD 
Bus Hold Request (Active High; Input) 


Allows another bus master to request control of the 
local bus. 


INTR 
interrupt Request (Active High; Input) 


A maskable input that signals the AmM386DE micropro- 
cessor to suspend execution of the current program and 
execute an interrupt acknowledge function. 


T6cK 
Bus Lock (Active Low; Output) 


Abus cycle definition pin that indicates that other system 
bus masters are denied access to the system bus while 
it is active. 

M16 

Memory I/O (Output) 

A bus cycle definition pin that distinguishes memory 
cycles from input/output cycles. 

NA 

Next Address (Active Low; Input) 

Used to request address pipelining. 


NC 
No Connect 


Should aways remain unconnected. Connection of an 
NC pin may cause the processor to malfunction or be 
incompatible with future steppings of the Am386DE 
microprocessor. 


NMI 
Non-Maskable Interrupt Request 
(Active High; Input) 


Anon-maskable input that signals the AM386DE micro- 
processor to suspend execution of the current program 
and execute an interrupt acknowledge function. 


PEREQ 
Processor Extension Request (Active High; Input) 


Indicates that the processor extension has data to be 
transferred by the AmM386DE microprocessor. 


READ 
Bus Ready (Active Low; Input) 


Terminates the bus cycle. 
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RESET 
Reset (Active High; Input) 
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Vss 
System Ground (Input) 


Suspends any operation in progress and places the Provides 0-V connection from which all inputs and out- 
Am386DE microprocessor in a known reset state. puts are measured. 


Vee 
System Power (Input) 
Provides 3-5-V nominal DC supply input. 


WR 
Write/Read (Output) 


Abus cycle definition pin that distinguishes write cycles 
from read cycles. 
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BASE ARCHITECTURE 


Introduction 


The Am386DE microprocessor consists of a central pro- 
cessing unit, a memory management unit, and a bus 
interface. 


The central processing unit consists of the execution 
unit and instruction unit. The execution unit contains the 
eight 32-bit general purpose registers that are used for 
both address calculation, data operations, and a 64-bit 
barrel shitter used to speed shift, rotate, multiply, and 
divide operations. The multiply and divide logic uses a 
1-bit per cycle algorithm. The multtiply algorithm stops 
the iteration when the most significant bits of the multi- 
plier are all zero. This allows typical 32-bit multiplies to 
be executed in under 1 ms. The instruction unit decodes 
the instruction op-codes and stores them in the 
decoded instruction queue for immediate use by the 
execution unit. 


Memory is organized into one or more variable length 
segments, each up to 4 Gbytes in size. Agiven region of 
the physical address space, a segment, can have attrib- 
utes associated with it. These attributes include its loca- 
tion, size, type (i.e., stack, code, or data), and protection 
characteristics. The Am386DE microprocessor can 
have a maximum of 16,381 segments of up to 4 Gbytes 
each. 


The segmentation unit provides four levels of protection 
for isolating and protecting applications and the operat- 
ing system from each other. The hardware enforced 
protection allows the design of system with a high 
degree of integrity. 


The Am386DE microprocessor has two modes of 
operation: Real Address Mode (Real Mode) and Pro- 
tected Address Mode (Protected Mode). In Real Mode, 
the Am386DE device operates as a very fast 8086 but 
with 32-bit extensions, if desired. Real Mode is required 
primarily to setup the processor for Protected Mode 
operation. Protected Mode provides an address to the 
sophisticated memory management and privilege capa- 
bilities of the processor. 


Finally, to facilitate high-performance system hardware 
designs, the Am386DE microprocessor bus interface 
offers address pipelining, dynamic data bus sizing, and 
direct Byte Enable signals for each byte of the data bus. 
These hardware features are described fully beginning 
in the Functional Data section. 

Register Overview 


The Am386DE microprocessor has 28 register 
resources in the following categories. 

« General Purpose Registers 

« Segment Registers 

e Instruction Pointer and Flags 

¢ Control Register 
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e System Address Registers 
e¢ Debug Registers 


The registers are a superset of the 8086, 80186, and 
80286 registers, so all 16-bit 80186 and 80286 registers 
are contained within the 32-bit Am386DE microprocessor. 


Figure 1 shows all the Am386DE microprocessor base 
architecture registers that include the general address 
and data registers, the instruction pointer, and the flags 
register. The contents of these registers are task-spe- 
Citic, so these registers are automatically loaded with a 
new context upon a task switch operation. 


The base architecture also includes six directly accessi- 
ble segments, each up to 4 Gbytes in size. The seg- 
ments are indicated by the selector values placed in 
Am386DE CPU segment registers of Figure 1. Various 
selector values can be loaded as a program executes, if 
desired. 


The selectors are also task specific, so the segment reg- 
isters are automatically loaded with new context upon a 
task switch operation. 


The other types of registers Control, System Address, 
and Debug are primarily used by system software. 


General Data and Address Registers 
31 16 15 0 


EAX 
EBX 
ECX 
EDX 
ESI 

EDI 

EBP 
ESP 


Segment Selector Registers 
15 0 


CS Code 
ss Stack 


DS 
ES 
FS 
GS 


Data 


Instruction Pointer and Flags Registers 
31 16 15 0 


ae ae ee 
| | FLAGS | EFLAGS 


18421A-003 
Figure 1. Base Architecture Registers 
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Register Descriptions 


General-Purpose Registers 


The eight general-purpose registers of 32 bits hold data 
or address quantities. The general registers, Figure 2, 
support data operands of 1, 8, 16, 32, and 64 bits and bit 
fields of 1 to 32 bits. They support address operands of 
16 and 32 bits. The 32-bit registers are named EAX, 
EBX, ECX, EDX, ES!, EDI, EBP, and ESP. 


The least significant 16 bits of the registers can be 
accessed separately. This is done by using the 16-bit 
names of the registers AX, BX, CX, DX, SI, Di, BP, and 
SP. When accessed as a 16-bit operand, the upper 16 
bits of the register are neither used nor changed. 


Finally, 8-bit operations can individually access the 
lower byte (bits 7-0) and the higher byte (bits 15~8) of 
general purpose registers AX, BX, CX, and DX. The 
lower bytes are named AL, BL, CL, and DL, respectively. 
The higher bytes are named AH, BH, CH, and DH, 
respectively. The individual byte accessibility offers 
additional flexibility for data operations, but is not used 
for effective address calculation. 


Instruction Pointer 


The instruction pointer, Figure 2, is a 32-bit register 
named EIP. EIP holds the offset of the next instruction to 
be executed. The offset is always relative to the base of 
the code segment (CS). The lower 16 bits (bits 15-0) of 
EIP contain the 16-bit instruction pointer named IP, 
which is used by 16-bit addressing. 


Flags Register 


The Flags Register is a 32-bit register named EFLAGS. 
The defined bits and bit fields within EFLAGS, shown in 
Figure 3, contro! certain operations and indicate status 
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Figure 2. Genera! Registers 
and Instruction Pointer 


of the Am386DE microprocessor. The lower 16 bits (bits 


15-0) of EFLAGS contain the 16-bit flag register named 
FLAGS, which is most useful when executing 8086 and 
80286 code. 


Note in the following descriptions, set means set to 1 
and reset means reset to 0. 


RF (Resume Flag, bit 16) 


The RF flag is used in conjunction with the debug 
register breakpoints. It is checked at instruction 
boundaries before breakpoint processing. When 
PF is set, it causes any debug fault to be ignored 
on the next instruction. RF is then automatically 
reset at the successful completion of every 
instruction (no fauks are signaled) except the 


| ‘ | eee. Carry Flag 
Resume Flag Parity Flag 
Nested Task Flag Auxiliary Carry 
I/O Privilege Level Zero Flag 
Overfiow Sign Flag 
Direction Flag Trap Flag 
Interrupt Enable 


Note: 


0 indicates “Reserved for Future Use.” Do not define; see the Compatibility section on page 17. 
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Figure 3. Flags Registers 
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(RET instruction and the POPF instruction. (JMP, 
CALL, and INT instructions causing a task switch.) 
These instructions set RF to the value specified by 
the memory image. For example, at the end of the 
breakpoint service routine, the IRET instruction 
can pop an EFLAGS image having the RF bit set 
and resume the program's execution at the break- 
point address without generating another break- 
point fault on the same location. 


NT (Nested Task, bit 14) 


This flag applies to Protected Mode. NT is set to 
indicate that the execution of this task is nested 
within another task. If set, it indicates that the cur- 
rent nested task’s Task State Segment (TSS) has 
a valid back link to the previous task's TSS. This bit 
is set or reset by control transfers to other tasks. 
The value of NT in EFLAGS is tested by the IRET 
instruction to determine whether to do an inter- 
task return or an intra-task return. A POPF or an 
IRET instruction will affect the setting of this bit 
according to the image popped at any privilege 
level. 


IOPL (Input/Output Privilege Level, bits 12~13) 


This two-bit field applies to Protected Mode. IOPL 
indicates the numerically maximum CPL (current 
privilege level) value permitted to execute '/O 
instructions without generating an Exception 13 
fault or consulting the I/O Permission Bitmap. It 
also indicates the maximum CPL value allowing 
alteration of the IF (INTR Enable Flag) bit when 
new values are popped into the EFLAGS register. 
POPF and IRET instruction can alter the JOPL 
field when executed at CPL = 0. Task switches 
can always alter the IOPL field when the new flag 
image is loaded from the incoming task’s TSS. 


OF (Overtiow Flag, bit 11) 


OD is set if the operation resulted in a signed over- 
flow. Signed overflow occurs when the operation 
resulted in carry/borrow into the sign bit (high-order 
bit) of the result but did not result in a carry/borrow 
out of the high-order bit or vice-versa. For 8-, 16-, 
and 32-bit operations, OF is set according to over- 
flow at bits 7, 15, and 31, respectively, 


DF (Direction Flag, bit 10) 


DF defines whether ESI and/or ED! registers post- 
decrement or postincrement during the sg instruc- 
tions. Postincrement occurs if DF is reset. 
Postdecrement occurs if DF is set. 


IF — (INTR Enable Flag, bit 9) 


The IF flag, when set, allows recognition of exter- 
nal interrupts signaled on the INTR pin. When IF is 
reset, external interrupts signaled onthe INTR are 
not recognized. !OPL indicates the maximum CPL 
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value allowing alteration of the IF bit when new 
values are popped into EFLAGS or FLAGS. 


TF (Trap Enable Fiag, bit 8) 


TF controls the generation of Exception 1 trap 
when single-stepping through code. When TF is 
set, the Am386DE microprocessor generates an 
Exception 1 trap after the next instruction is 
executed. When TF is reset, Exception 1 traps 
occur only as a function of the breakpoint 
addresses loaded into debug register DR3—-DRO. 


SF (Sign Flag, bit 7) 
SF is set if the high-order bit of the result is set; it 
is reset otherwise. For 8-, 16-, and 32-bit opera- 
tions, SF reflects the state of bits 7, 15, and 31, 
respectively. 


ZF (Zero Flag, bit 6) 


ZF is set if all bits of the result are 0. Otherwise it 
is reset. 


AF (Auxiliary Carry Flag, bit 4) 

The Auxiliary Flag is used to simplify the addition 
and subtraction of packed BCD quantities. AF is 
set if the operation resulted in a carry out of bit 3 
(addition) or a borrow into bit 3 (subtraction). 
Otherwise, AF is reset. AF is affected by carry out 
of, or borrow into, bit 3 only; regardless of overall 
operand length: 8, 16, or 32 bits. 


PF (Parity flags, bit 2) 
PF is set if the low-order 8 bits of the operation 
contain an even number of 1s (even parity). PF is 
reset if the low-order 8 bits have odd parity. PFisa 
function of only the low-order 8 bits, regardless of 
operand size. 


CF (Carry Flag, bit 0) 
CF is set if the operation resulted in a carry out of 
(addition) or a borrow into (subtraction) the high- 
order bit. Otherwise, CF is reset. For 8-, 16-, or 
32-bit operations, CF is set according to carry/bor- 
row at bits 7, 15, or 31, respectively. 


Segment Registers 


Six 16-bit segment registers hold segment selector 
values identifying the currently addressable memory 
segments. Segment registers are shown in Figure 4. 
In Protected Mode, each segment may range in size 
from one byte up to the entire linear and physical space 
of the machine, 4 Gb (2%2 bytes). If a maximum sized 
segment is used (limit = FFFFFFFFH), it should be 
Dword aligned (i.e., the least two significant bits of the 
segment base should be zero). This alignment will avoid 
a segment limit violation (Exception 13) caused by the 
wrap around. In Real Address Mode, the maximum 
segment size is fixed at 64 Kb (2'© bytes). 
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Segment 
Registers Descriptor Registers (Loaded Automatically) 
"oo 
Other 
_ Segment 
15 0 Physical Base Address Segment Limit Attributes from Descriptor 


Selector 


Figure 4. Segment Registers and Assoclated Descriptor Registers 


The six segments addressable at any given moment are 
detined by the segment registers: CS, SS, DS, ES, FS, 
and GS. The selector in SS indicates the current stack 
segment; the selectors in DS, ES, FS, and GS indicate 
the current data segments. 


Segment Descriptor Registers 


The segment descriptor registers are not programmer 
visible, yet it is very useful to understand their content. 
Inside the Am386DE microprocessor, a descriptor 
register (programmer invisible) is associated with 
each programmer-visible segment register, as shown 
by Figure 4. Each descriptor register holds a 32-bit seg- 
ment base address, a 32-bit segment limit, andthe other 
necessary segment attributes. 


When a selector value is loaded into a segment register, 
the associated descriptor register is automatically 
updated with the correct information. In Real Address 
Mode, only the base address is updated directly (by 
shifting the selector value four bits to the left), since the 
segment maximum limit and attributes are fixed in Real 
Mode. In Protected Mode, the base address, the limit, 
and the attributes are all updated per the contents of the 
segment descriptor indexed by the selector. 


Whenever a memory reference occurs, the segment 
descriptor register associated with the segment being 
used is automatically involved with the memory refer- 
ence. The 32-bit segment base address becomes a 
component of the physical address calculation, the 
32-bit limit is used for the limit-check operation, and the 
attributes are checked against the type of memory refer- 
ence requested. 


Control Register 


The AmM386DE microprocessor has a 32-bit control reg- 
ister, CRO, to hold the machine state of a global nature. 
This register, along with System Address Registers 
described in the next section, hold machine state that 
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affects all tasks in the system. To access the Control 
Registers, load and store instructions are defined. 


CRO: Machine Control Register (Includes 80286 
Machine Status Word) 


CRO, shown in Figure 5, contains six defined bits for 
control and status purposes. The low-order 16 bits of 
CRO are also known as the Machine Status Word 
(MSW) for compatibility with 80286 Protected Mode. 
LMSW and SMSW instructions are taken as special 
aliases of the load and store CRO operations, where only 
the low-order 16 bits of CRO are involved. For compati- 
bility with 80286 operating systems, the Am386DE 
microprocessor LMSW instructions work in an identical 
fashion to the LMSW instruction on the 80286 (i.e., it 
only operates on the low-order 16 bits of CRO and it 
ignores the new bits in CRO). New Am386DE micropro- 
cessor operating systems should use the MOV CRO, 
Reg instruction. 


The defined CRO bits are described below. 


R__ (Reserved, bit 4) 


This bit is Reserved for Future Use. When loading 
CRO care should be taken to not alter the value of 
this bit. 


TS (Task Switched, bit 3) 


TS is automatically set whenever a task switch 
operation is performed. If TS is set, a coprocessor 
ESCape op-code will cause a Coprocessor Not 
Available trap (Exception 7). The trap handler typi- 
cally saves a 387DX math coprocessor context 
belonging to a previous task, loads a 387DX math 
coprocessor state belonging to the current task, 
and clears the TS bit before returning to the fault- 
ing coprocessor op-code. 


EM (Emulate Coprocessor, bit 2) 


The Emulate coprocessor bit is set to cause all 
coprocessor op-codes to generate a Coprocessor 
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CRO 


0 indicates “Reserved for Future Use.” Do not define; see the Compatibility section on page 17. 


Figure 5. Control Register 0 


Not Available fault (Exception 7). It is reset to allow 
coprocessor op-codes to be executed on an 
actual 387DX math coprocessor (this is the default 
case after reset). Note that the WAIT op-code is 
not affected by the EM bit setting. 


MP (Monitor Coprocessor, bit 1) 


The MP bit is used in conjunction with the TS bit to 
determine if the WAIT op-code will generate a 
Coprocessor Not Available fault (Exception 7) 
when TS = 1. When both MP = 1 and TS = 1, the 
WAIT op-code generates a trap. Otherwise, the 
WAIT op-code does not generate a trap. Note that 
TS is automatically set whenever a task switch 
operation is performed. 


(Protection Enable, bit 0) 


The PE bit is set to enable the Protected Mode. If 
PE is reset, the processor operates again in Real 
Mode. PE may be set by loading MSW or CRO. PE 
canbe reset only by a load into CRO. Resetting the 
PE bit is typically part of a longer instruction 
sequence needed for proper transition from Pro- 
tected Mode to Real Mode. Note that for strict 
80286 compatibility, PE cannot be reset by the 
LMSW instruction. 


CR1: Reserved 
CR1 is reserved for future processors. 


PE 


System Address Registers 


47 32-Bit Linear Base Address 1615 Limit 


System Segment 
eee k 


32- “—~sGhineasasAgies.  a0BNSeonenlank Anne” Linear Base Address 
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System Address Registers 


Four special registers are defined to reference the 
tables or segments supported by the 80286 CPU and 
Am386DE microprocessor protection model. 


These tables or segments are: 


GDT (Global Descriptor Table) 

IDT (Interrupt Descriptor Table) 

LDT (Locat Descriptor Table) 

TSS (Task State Segment) 
The addresses of these tables and segments are stored 
in special registers, the System Address and System 
Segment Registers illustrated in Figure 6. These regis- 
ters are named GDTR, IDTR, LDTR, and TR, respec- 


tively. The Protected Mode Architecture section 
describes the use of these registers. 


GDTR and IDTR 
These registers hold the 32-bit linear base address and 
16-bit limit of the GDT and IDT, respectively. 


The GDT and IDT segments, since they are global to all 
tasks in the system, are defined by 32-bit linear 
addresses and 16-bit limit values. 


0 


Plier ei Sak sh Saas ase all ae Registers (Automatically Loaded) 


32-Bit Segment Limit Attributes 


ae 
LDTR | Selector —_ | 


Figure 6. System Address and System Segment Reglsters 
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LDTR and TR 


These registers hold the 16-bit selector for the LDT 
descriptor and the TSS descriptor, respectively. 


The LDT and TSS segments, since they are task-specific 
segments, are defined by selector values stored in the 
system segment registers. Note that a segment descrip- 
tor register (programmer-invisible) is associated with 
each system segment register. 


Debug Registers 


Debug Registers: The six programmer accessible 
debug registers provide on-chip support for debugging. 
Debug Registers DR3-DRO specify the four linear 
breakpoints. The Debug Control Register DR7 is used 
to set the breakpoints, and the Debug Status Register 
DR6 displays the current state of the breakpoints. The 
use of the debug registers is described in the Debugging 
Support section. Figure 7 shows the Debug registers. 
Reglster Accessibility 


There are a few differences regarding the accessibility 
of the registers in Real and Protected Mode. Table 1 


ai Debug Registers . 
i DRO 

i DR1 

i DR2 

i DR3 

. ine. DR4 

: ine. DR5 

i DR6 

OR7 


Linear Breakpoint Address 0 
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Figure 7. Debug Registers 


summarizes these differences. See the Protected Mode 
Architecture section for further details. 


Compatibility With Future Processors 


In the preceding register descriptions, note certain 
Am386DE microprocessor register bits are Reserved 
for Future Use. When reserved bits are called out, treat 
them as fully undefined. This is essential for software 
compatibility with future processors! Follow the guide- 
lines below: 


1. Do not depend on the state of any undefined bits 
when testing the values of defined register bits. Mask 
them out when testing. 


2. Do not depend on the state of any undefined bits 
when storing them to memory or another register. 


3. Do not depend on the ability to retain information 
written into any undefined bits. 


4. When loading registers, always load the undefined 
bits as zeros. 


5. However, registers that have been previously stored 
may be reloaded without masking. 


Depending upon the values of undefined register bits 
will make your software dependent upon the unspeci- 
fied Am386DE microprocessor handling of these bits. 
Depending on undefined values risks making software 
incompatible with future processors that define usages 
for the AM3886DE CPU undefined bits. Avoid any soft- 
ware dependence upon the state of undefined 
Am386DE CPU register bits. 


Table 1. Register Usage 


Yes 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Note: 


Use In Use In 
Real Mode Protected Mode 
Register Load 


General Registers | Yes | 
Segment Registers | Yes | 


[Store 
ee 
[Yes | Yes _ 
[Yes | Yes 
a 
Ce 
[Ne [_Ne 


Debug Control [Yes | 


Load 


[Store _| 


PL = 0: The registers can be accessed only when the current privilege level is zero. 
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Instruction Set 


Instruction Set Overview 


The instruction set is divided into nine categories of 
operations. 


Data Transfer 

Arithmetic 

Shift/Rotate 

String Manipulation 

Bit Manipulation 

Control Transfer 

High Level Language Support 
Operating System Support 
Processor Control 


These Am386DE microprocessor instructions are listed 
in Table 2. 


All Am386DE microprocessor instructions operate on 
either 0, 1, 2, or 3 operands where an operand resides in 
a register in the instruction itself or in memory. Most zero 
operand instructions (e.g., CLI, STI) take only one byte. 
One operand instructions generally are two bytes long. 
The average instruction is 3.2-bytes long. Since the 
Am386DE device has a 16-byte instruction queue, an 
average of 5 instructions will be prefetched. The use of 
two operands permits the following types of common 
instructions. 


Register to Register 
Memory to Register 
Immediate to Register 
Register to Memory 
Immediate to Memory 


The operands can be either 8-, 16-, or 32-bits long. Asa 
general rule, when executing code written for the 
Am386DE microprocessor (32-bit code), operands are 
8 or 32 bits; when executing existing 80286 or 8086 
code (16-bit code), operands are 8 or 16 bits. Prefixes 
can be added to instructions that override the default 
length of the operands (i.e., use 32-bit operands for 
16-bit code or 16-bit operands for 32-bit code). 


Addressing Modes 


Addressing Modes Overview 


The Am386DE microprocessor provides a total of 11 
addressing modes for instructions to specify operands. 
The addressing modes are optimized to allow the effi- 
cient execution of high-level languages such as C and 
FORTRAN, and they cover the vast majority of data ref- 
erences needed by high-level languages. 


PRELIMINARY 


Register and Immediate Modes 


Two of the addressing modes provide for instructions 
that operate on register or immediate operands: 


Register Operand Mode: The operand is located in 
one of the 8-, 16-, or 32-bit general registers. 


Immediate Operand Mode: The operand is included in 
the instruction as part of the op-code. 


32-Bit Memory Addressing Modes 


The remaining nine modes provide a mechanism for 
specifying the effective address of an operand. The 
physical address consists of two components: the seg- 
ment base address and an effective address. The effec- 
tive address is calculated by using combinations of the 
following four address elements. 


Displacement: An 8- or 32-bit immediate value follow- 
ing the instruction. 


Base: The contents of any general-purpose register. 
The Base registers are generally used by compilers to 
point to the start of the local variable area. 


Index: The contents of any general-purpose register 
except for ESP. The Index registers are used to access 
the elements of an array, or a string of characters. 


Scale: The index register's value can be multiplied by a 
scale factor of either 1, 2, 4, or 8. Scaled index mode is 
especially usetul for accessing arrays or structures. 


Combinations of these four components make up the 
nine additional addressing modes. There is no perfor- 
mance penalty for using any of these addressing com- 
binations, since the effective address calculation is 
pipelined with the execution of other instructions. 


The one exception is the simultaneous use of Base and 
Index components that requires one additional clock. 


As shown in Figure 8, the effective address (EA) of an 
operand is calculated according to the following formula: 


EA = Base Reg + (Index Reg - Scaling) + Displacement 
Direct Mode: The operand’s offset is contained as part of 
the instruction as an 8-, 16-, or 32-bit displacement. 


Example: INC Word PTR [500] 


Register Indirect Mode: A Base register contains the 
address of the operand. 


Example: MOV [ECX], EDX 


Based Mode: A Base register's contents is added to a 
Displacement to form the operands offset. 
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Example: MOV ECX, [EAX + 24] 


Index Mode: An Index register’s contents is added to a 
Displacement to form the operands offset. 


Example: ADD EAX, TABLE [ESI] 
Scaled Index Mode: An Index register's contents is mul- 
tiplied by a scaling factor that is added to a Displacement 
to form the operands offset. 


Example: MUL EBX, TABLE [ESI e 4], 7 


Based Index Mode: The contents of a Base register is 
added to the contents of an Index register to form the 
effective address of an operand. 


Example: MOV EAX, [ESI] [EBX] 


Based Scaled Index Mode: The contents of an Index 
register is multiplied by a Scaling factor and the result is 
added to the contents of a Base register to obtain the 
operands offset. 


Segment Registers 


Descriptor Registers 


[__Accoss Rights GS | 
TL Access Rights FS] 
FP] Access Fighis ES | 
rT Access Rights 0S | 
F[[__Aecess Rights CS | 
| Limit 


Base Address 


Figure 8. Addressing Mode Calculations 


Effective 
Address 


AMD an 


Example: MOV ECX, [EDX « 8] [EAX] 


Based Index Mode with Displacement: The contents of 
an Index Register and a Base register’s contents anda 
Displacement are all summed together to form the 
operand offset. 


Example: ADD EDX, [ESI] [EBP + 00FFFFFOH] 


Based Scaled Index Mode with Displacement: The 
contents of an Index register are multiplied by a Scaling 
factor; the result is added to the contents of a Base reg- 
ister and a Displacement to form the operand's offset. 


Example: MOV EAX, LOCALTABLE[EDI « 4] [EBP + 80} 


Index Register 


Scale 
1,2, 4,or8 


Displacement 
(In Instruction) 


Linear 
Address 


Target Address 


Selected 
Segment 


Segment Base Address 


18421A-010 
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Table 2. Am386DE Microprocessor Instructions 


Table 2a. Data Transfer 


Push all registers on stack 
Pop ail registers off stack 
Exchange operand register 


Translate 


[PUSH 
[POP [Pop operandof tack 
[FOPR | 
DCG | 


XLAT 


ARE 
ae) a) Re) 


MOVZX | Move byte or Word, Dword with zero extension 


MOVSX 
WOE 
oba 


Input/Output 
Input operand from I/O space 


OUT Output operand to I/O space 


TEA 
(05 
LES 
LFS 
iGs 
iss 
LAH 
SAF 
PUSHF 
POPF 
PUSHFD | Push EFLAGS onto stack 
POPFD 
cic 
cub 
omc 
STC 
STO 
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Table 2b. Arithmetic Instructions 


ADD 
DAA Decimal adjust for addition 


> 
9 
(oe) 


INC 


> 
> 
> 


SBB 


DAS Decimal adjust for subtraction 
AAS ASCIil adjust for subtraction 


MUL 
MUL 


Divide unsigned 
IDIV Integer divide 
ASCII adjust before division 


Table 2c. String Instructions 


MOVS | Move byte or Word, Dword string 
Input string from 1/O space 
EP 


REPE/ 

REPZ_ | Repeat white equal/zero 
RENE/ 

REPNZ | Repeat while not equal/not zero 
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Table 2. Am386DE Microprocessor Instructions (continued) 


Table 2d. Logical Instructions 


NOT 
XOR 


“NOT" operand 
“AND” operands 
“Inclusive OR” operands 


“Exclusive OR” operands 
TEST “Tast” operands 


SHL/SHR_ | Shift logical left or right 


SAL/SAR_ | Shift arithmetic left or right 


SHLD/ 
SHRD Double shift left or right 


ROL/ROR | Rotate left/right 


RCL/RCR | Rotate through carry left/right 


Table 2e. Bit Manipulation Instructions 


B 


it Tast and Complement 
Bit Scan Forward 


Table 2f. Program Control Instructions 


SETOG 
TRIN 
IRENE 
TBIINRE 
IBEINA 


BT 
BTS 
BTR 
BTC 
SF 


B 


J Jump if carry 
JEMZ Jump if equal/zero 
JG/NLE Jump if greater/not less nor equal 


JGE/JNL 
JL/AINGE 
JLE/ING 
J 
J 


Jump if not overflow 


Jump if sign 


iC 
N' 
JNE/JNZ 
N' 
N 
le) 


Cc 

le) 
JNP/JPO 
JNS 
J 
JPAPE 
Js 
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Table 2f. Program Control Instructions 
(continued) 


Unconditional Transfers 
CALL Call procedurefask 
RET Return from procedure 


Iteration Controls 


[OOP 


LOOPE/ 
LOOPZ Loop if equal/zero 
Loop if not equal/not zero 


LOOPNE/ 
JUMP if register CX = 0 


LOOPNZ 
NT [ ener CS 


JCOXZ 
Interrupt if overflow 
Return from interrupttask 
Clear interrupt enable 
Set interrupt enable 


Table 2g. High Level Language Instructions 


BOUND | Check array bounds 
ENTER | Setup parameter block for entering procedure 
LEAVE 


Table 2h. Protection Model 


ARPL 
LAR 
LS 


VERR/ 
Verify segment for reading or writing 


VERW 
LMSW Load machine status word (lower 16 bits 
of CRO) 


Store machine status word 


INTO 
(RET 
CLI 
STl 


SGDT 
SIOT 
STR 
SLOT 
LGDT 
LIDT 
LTR 
LLDT 


SMSW 


Table 21. Processor Control Instructions 


Wait until BUSY negated 


Lock Bus 


HLT 
WAIT 


ESC 
LOCK 
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Differences Between 16- and 32-Bit Addresses 


In order to provide software compatibility with the 80286 
and the 8086, the Am386DE microprocessor can 
execute 16-bit instructions in Real and Protected 
Modes. The processor determines the size of the 
instructions it is executing by examining the D bit in the 
CS segment descriptor. If the D bit is 0 then all operand 
lengths and effective addresses are assumed to be 16 
bits long. If the D bit is 1 then the default length for oper- 
ands and addresses is 32 bits. In Real Mode, the default 
size for operands and addresses is 16 bits. 


Regardless of the default precision of the operands or 
addresses, the Am386DE microprocessor is able to 
execute either 16- or 32-bit instructions. This is specified 
via the use of override prefixes. Two prefixes, the Oper- 
and Size Prefix and the Address Length Prefix, override 
the value of the D bit on an individual instruction basis. 


Example: The processor is executing in Real Mode and the 
programmer needs to access the EAX registers. The assem- 
bler code for this might be MOV EAX, 32-bit MEMORYOP. An 
assembler automatically determines that an Operand Size 
Prefix is needed and generates it. 

Example: The D bit is 0 and the programmer wishes to use 
Scaled Index addressing mode to access an array. The 
Address Length Prefix allows the use of MOV DX, 
TABLE [ESI ¢ 2]. The assembler uses an Address Length Pre- 
fix, since with D = 0, the default addressing mode is 16 bits. 
Example: The D bitis 1 and the program wants to store a 16-bit 
quantity. The Operand Length Prefix is used to specify only a 
16-bit value: MOV MEM16, DX. 


The Operand Length and Address Length prefixes can 
be applied separately or in combination to any instruc- 
tion. The Address Length Prefix does not allow 
addresses over 64 Kbytes to be accessed in Real Mode. 
A memory address exceeding FFFFH will result in a 
General Protection Fault. An Address Length Prefix only 
allows the use of the additional Am386DE microproces- 
sor addressing modes. 


When executing 32-bit code, the Am386DE micropro- 
cessor uses either 8- or 32-bit displacements, and any 
register can be used as base or index registers. When 
executing 16-bit code, the displacements are either 8 or 
16 bits, and the base and index register conform to the 
80286 model. Table 3 illustrates the differences. 
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Data Types 


The Am386DE microprocessor supports ail data types 
commonly used in high-level languages. 


Bit: A single bit quantity. 


Bit Field: A group of up to 32 contiguous bits that spans a 
maximum of four bytes. 


Bit String: A set of contiguous bits on the Am386DE 
microprocessor bit strings can be up to 4 Gbytes long. 


Byte: A signed 8-bit quantity. 
Unsigned Byte: An unsigned 8-bit quantity. 
Integer (Word): A signed 16-bit quantity. 


Long Integer (Double Word): A signed 32-bit quantity. All 
operations assume a 2's complement representation. 


Unsigned Integer (Word): An unsigned 16-bit quantity. 


Unsigned Long Integer (Double Word): An unsigned 
32-bit quantity. 


Signed Quad Word: A signed 64-bit quantity. 
Unsigned Quad Word: An unsigned 64-bit quantity. 


Oftset: A 16- or 32-bit offset only quantity that indirectly 
references another memory location. 


Pointer: A full pointer which consists of a 16-bit segment 
selector and either a 16- or 32-bit offset. 


Char: A byte representation of an ASCII alphanumeric 
or contro! character. 


String: A contiguous sequence of bytes, words, or 
Dwords. A string may contain between 1 byte and 4 
Gbytes. 


BCD: A byte (unpacked) representation of decimal 
digits 0-9. 

Packed BCD: A byte (packed) representation of two 
decimal digits 0-9 storing one digit in each nibble. 


When the Am386DE microprocessor is coupled with a 
387DX math coprocessor then the following common 
floating-point types are supported. 


Table 3. Base and Index Registers for 16- and 32-Blt Addresses 


Re 4 oe | 16-Bit Addressing 32-Bit Addressing 


Base Register 
Index Register 


Scale Factor 
Displacement 


Any 32-bit GP Register 
Any 32-bit GP Register 


Except ESP 


1,2, 4,8 
0, 8, 32 bits 
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Floating Point: A signed 32-, 64-, or 80-bit real number Figure 9 illustrates the data types supported by the 
representation. Floating-point numbers are supported Am386DE microprocessor and a 387DX compatible 


by a 387DX compatible math coprocessor. math coprocessor. 
it 0 7 i) 
Signed Unsigned 
Byte Byta 
Sign Bit JE = 
sea Magnitude 


1514 


15 +1 0 0 
Signed Unsigned 
Word Word 
Sign Bit J comMsB ll ne 


<——-"ikeai Magnitude 
31 +3 +2 +1 0 


Signed 
Double 
Word 


Sign Bit J, Luss 


Unsigned 
Double 
Word 


Magnitude 


5 ' 
63 *” 6 up47 * +4 325, *9 72 i615 * oo 


Signed 
Quad 
Word 


Sign Bit J, LMSB 


Magnitude 
7 +N 0 07 0 0 
Binary 
Coded eee 
Decimal 
(BCD) BCD Digit N BCD Digit 1 BCD Digit 0 
07 0 0 


7 +N C1) 7 
jeoee 


ASCII Character N ASCII Character 1 ASCII Character 0 


7 +N 0 7 +1 07 9 0 
Packed 
BCD 


Most Significant Digit Least Significant Digit 


+N +1 


FAIS 7115 QO 7/18 


Byte 


Figure 9. Supported Data Types 18421A-014 
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+2 Gbits -2 Gbits 
210 
Bit 
String 
Bit 0 
31 +3 +2 +1 i?) 0 


Short 
32-Bit 
Pointer 


47 +5 +4 +3 +2 +1 0 0 


Long 
48-Bit 
Pointer 


73 i?) 


Floating 
Point* 


Sign Bit J t i ' 
Exponent Magnitude 


+5 +4 +3 +2 +1 0 


32-Bit 
Bit Field 


}¢——_____________ BitFied ____....._g/ 
1 to 32 Bits 


Note: 


* Supported by 387DX-compatible math coprocessor. 18421A-012 


Figure 9. Supported Data Types (continued) 
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offset. A selector is the contents of a segment register. 


Memory Organization 


Introduction 


Memory on the Am386DE microprocessor is divided up 
into 8-bit quantities (Bytes), 16-bit quantities (Words), 
and 32-bit quantities (Dwords). Words are stored in two 
consecutive bytes in memory with the low-order byte at 
the lowest address, the high-order byte at the highest 
address. Dwords are stored in four consecutive bytes in 
memory with the low-order byte at the lowest address, 
the high-order byte at the highest address. The address 
of a word or Dword is the byte address of the low-order 
byte. 


In addition to these basic data types, the Am386DE 
microprocessor supports larger variable length units 
with segmentation. Memory can be divided up into one 
or more variable length segments, which can be shared 
between programs. Segmentation is useful for organiz- 
ing memory in logical modules, and as such is a too! for 
the application programmer. 


Address Spaces 

The Am386DE microprocessor has two distinct address 
spaces: logical and physical. A logical address (also 
known as a virtual address) consists of a selector and an 


Effective Address Calculation 


Logical Address Segmentation 
14 Unit 


Selector 


- Descriptor Index 
Sagment Register 


An offset is formed by summing all of the addressing 
components (Base, Index, Displacement) discussed in 
Section Memory Address Modes into an effective 
address. Each task on AmM386DE CPU has a maximum 
of 16K (2'4 -1) selectors. 


The segmentation unit translates the logical address 
space into a 32-bit physical address space. The physi- 
cal address is what appears on the address pins. The 
total address space of the Am386DE is 4 Gbytes. 


The primary difference between Real Mode and Pro- 
tected Mode is how the segmentation unit performs the 
translation of the logical address into the physical 
address. In Real Mode, the segmentation unit shifts the 
selector left four bits and adds the result to the offset to 
form the physical address. While in Protected Mode, 
every selector has a base address associated with it. 
The base address is stored in one of two operating sys- 
tem tables (i.e., the Loca! Descriptor Table or Global 
Descriptor Table). The selector’s linear base address is 
added to the offset to form the final physical address. 


Figure 10 shows the relationship between the address 
spaces. 


Physical 
Memory 


Physical 
Address 


18421A-013 


Figure 10. Address Translation 
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Segment Register Usage 


The main data structure used to organize memory is the 
segment. On the Am386DE microprocessor, segments 
are variable sized blocks of physical addresses that 
have certain attributes associated with them. There are 
two main types of segments: code and data. The seg- 
ments are of variable size and can be as small as 1 byte 
or as large as 4 Gbytes (292 bytes). 


In order to provide compact instruction encoding and 
increase processor performance, instructions do not 
need to explicitly specify which segment register is 
used, A default segment register is automatically cho- 
sen according to the rules of Table 4 (Segment Register 
Selection Rules). In general, data references use the 
selector contained in the DS register; Stack references 
use the SS register; and Instruction fetches use the CS 
register. The contents of the Instruction Pointer provides 
the offset. Special segment override prefixes allow the 
explicit use of a given segment register, and override the 
implicit rules listed in Table 4. The override prefixes also 
allow the use of the ES, FS, and GS segment registers. 


There are no restrictions regarding the overlapping of 
the base addresses of any segments. Thus, all 6 seg- 
ments could have the base address set to zero and 
create a system with a 4-Gb physical address space. 
This creates a system where the virtual address space is 
the same as the physical address space. Further details 
of segmentation are discussed in the Protected Mode 
Architecture section on page 36. 


VO Space 


The Am386DE microprocessor has two distinct physical 
address spaces: Memory and I/O. Generally, peripher- 
als are placed in I/O space although the Am386DE CPU 
also supports memory-mapped peripherals. The I/O 
space consists of 64 Kbytes and can be divided into 64K 
8-bit ports, 32K 16-bit ports, or 16K 32-bit ports, or any 
combination of ports that add up to less than 64 Kbytes. 
The 64-Kbyte I/O address space refers to physical 
memory since I/O instructions do not go through the 
segmentation hardware. The M/IO pin acts as an addi- 
tional address line, thus allowing the system designer to 
easily determine which address space the processor is 
accessing. 


The 1/O ports are accessed via the IN and OUT (I/O 
instructions, with the port address supplied as an 
immediate 8-bit constant in the instruction or in the OX 
register. All 8- and 16-bit port addresses are zero 
extended on the upper address lines. The I/O instruc- 
tions cause the M/10 pin to be driven Low. 


/O port addresses 0OF8H through OOFFH are reserved. 


Interrupts 


Interrupts and Exceptions 


Interrupts and exceptions alter the normal program flow, 
in order to handle external events, to report errors or 
exceptional conditions. The difference between inter- 
rupts and exceptions is that interrupts are used to 


Table 4. Segment Register Selection Rules 


Type of Memory Reference Implied (Default) Segment Use Segment Override Prefixes Possible 
Code Fetch re 


Destination of PUSH, PUSHF, INT, 

CALL, PUSHA Instructions 

Source of POP, POPA, POPF, 

IRET, RET Instructions 

sfossrer us feructens es 


TOS, REP MOVS Instructions 
(Dl is Base Register) 


Other Data References with 
Effective Address Using Base 


Register of: 
[EAX] CS, SS, ES, FS, GS 
[EBX] CS, SS, ES, FS, GS 
{ECX] CS, SS, ES, FS, GS 
{EDX) CS, SS, ES, FS, GS 
[ESI] CS, SS, ES, FS, GS 
{EDI} CS, SS, ES, FS, GS 


[EBP] 
{ESP] 


CS, SS, ES, FS, GS 
CS, SS, ES, FS, GS 
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handle asynchronous external events while exceptions 
handle instruction faults. Although a program can gen- 
erate a software interrupt via an INT n instruction, the 
processor treats software interrupts as exceptions. 


Hardware interrupts occur as the result of an external 
event and are classified into two types: maskable or 
non-maskable. Interrupts are serviced after the execu- 
tion of the current instruction. After the interrupt handler 
is finished servicing the interrupt, execution proceeds 
with the instruction immediately after the interrupted 
instruction. The differences between the interrupts are 
discussed in the Maskable Interrupt and Non-Maskable 
Interrupt sections. 


Exceptions are classified as faults, traps, or aborts 
depending on the way they are reported, and whether or 
not restart of the instruction causing the exception is 
supported. Faults are exceptions that are detected and 
serviced before the execution of the faulting instruction. 
A fault occurs in a virtual memory system when the pro- 
cessor references a segment that is not present. The 
operating system fetches the segment, and then the 
Am886DE microprocessor restarts the instruction. 
Traps are exceptions that are reported immediately after 
the execution of the instruction that caused the problem. 
User defined interrupts are examples of traps. Aborts 


are exceptions that do not permit the precise location of 
the instruction causing the exception to be determined. 
Aborts are used to report severe errors, such as a hard- 
ware error or illegal values in system tables. 


Thus, when an interrupt service routine has been com- 
pleted, execution proceeds from the instruction immedi- 
ately following the interrupted instruction. On the other 
hand, the return address from an exception fault routine 
will always point at the instruction causing the exception 
and include any leading instruction prefixes. Table 5 sum- 
marizes the possible interrupts for the AM386DE micro- 
processor and shows where the return address points. 


The Am386DE microprocessor has the ability to handle 
up to 256 different interrupts/exceptions. In order to ser- 
vice the interrupts, a table with up to 256 interrupt vec- 
tors must be defined. The interrupt vectors are simply 
pointers to the appropriate interrupt service routine. In 
Real Mode (see the Real Mode Introduction section on 
page 34), the vectors are 4 byte quantities, a Code Seg- 
ment plus a 16-bit offset; in Protected Mode, the inter- 
rupt vectors are 8 byte quantities that are put in an 
Interrupt Descriptor Table (see the Introduction section 
on page 36). Of the 256 possible interrupts, 32 are 
Reserved for Future Use, the remaining 224 are free to 
be used by the system designer. 


Table 5. Interrupt Vector Assignments 


Interrupt 
Number 


Divide Error DIV, IDIV 
Debug Exception 
NMI Interrupt 


One Byte Interrupt 


INT 2 of NMI 


NTO 
BOUND 


Interrupt on Overflow 

Array Bounds Check 

Invalid Op-Code 

Device Not Available 

Double Fault 

Coprocessor Segment Overrun 
Invalid TSS 10 
Segment Not Present 
Stack Fauk 

General Protection Fault 
Reserved for Future Use 
Reserved for Future Use 


7 ESC, WAIT 


12 
13 


S Ps = = 
2 4 
fo) 
3 
© 


Coprocessor Error ESC, WAIT 
Reserved for Future Use 17-31 
Two Byte Interrupt 0-255 INT n 


Note: 


Instructions Which Can Cause Exceptions 


Any instruction 


Any illegal instruction 


Any instruction that can generate an Exception 


JMP, CALL, IRET, INT Yes AULT 
Segment register instructions 
Stack references 

Any memory reference 


Return Address 
Points to 
Faulting 
Instruction 


~< 


FAULT 
RAP* 


es 
8 


< 
a 


RAP 
RAP 
AULT 
‘e AULT 
Yes ‘AULT 
BORT 
BORT 


< 


6 


a 


< 
7) 


e AULT 
Yes AULT 
‘AULT 


~< 
7) 


~< 
oO 
a 


an 
mal 


AULT 


RAP 


*Some debug exceptions may report both traps on the previous instruction and faults on the next instruction. 
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Interrupt Processing 
When an interrupt occurs the following actions happen. 


e First, the current program address and the Flags are 
saved on the stack to allow resumption of the 
interrupted program. 

e Next, an 8-bit vector is supplied to the Am386DE 
microprocessor that identifies the appropriate entry 
in the interrupt table. The table contains the starting 
address of the interrupt service routine. Then, the 
user supplied interrupt service routine is executed. 

° Finally, when an IRET instruction is executed the old 
processor state is restored and program execution 
resumes at the appropriate instruction. 

The 8-bit interrupt vector is supplied to the AM386DE 

microprocessor in several different ways: exceptions 

supply the interrupt vector internally; software INT 
instructions contain or imply the vector; maskable hard- 
ware interrupts supply the 8-bit vector via the interrupt 
acknowledge bus sequence. Non-Maskable hardware 
interrupts are assigned to interrupt vector 2. 


Maskable Interrupt 


Maskable interrupts are the most common way the 
Am386DE microprocessor responds to asynchronous 
external hardware events. A hardware interrupt occurs 
when the INTR is pulled High and the Interrupt Flag bit 
(IF) is enabled. The processor only responds to inter- 
rupts between instructions (REPeat String instructions 
have an interrupt window between memory moves, 
which allows interrupts during long string moves). When 
an interrupt occurs, the processor reads an 8-bit vector 
supplied by the hardware that identifies the source of the 
interrupt (one of 224 user defined interrupts). The exact 
nature of the interrupt sequence is discussed in the 
Functional Data section on page 52. 


The IF bit inthe EFLAGS register is reset when an inter- 
rupt is being serviced. This effectively disables servicing 
additional interrupts during an interrupt service routine. 
However, the IF bit may be set explicitly by the interrupt 
handler to allow the nesting of interrupts. When an [RET 
instruction is executed, the original state of the IF bit is 
restored. 


Non-Maskable Interrupt 


Non-maskable interrupts provide a method of servicing 
very high priority interrupts. A common example of the 
use of a non-maskable interrupt (NMI) would be to acti- 
vate a power failure routine. When the NMI input is 
pulled High it causes an interrupt with an internally sup- 
plied vector value of 2. Unlike a normal hardware inter- 
rupt, no interrupt acknowledgment sequence is 
performed for NMI. 


While executing the NMI servicing procedure, the 
Am386DE microprocessor will not service further NMI 
requests until an interrupt return (IRET) instruction is 
executed or the processor is reset. If NMI occurs while 
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currently servicing an NMI, its presence will be saved for 
servicing after executing the first RET instruction. The 
IF bit is cleared at the beginning of an NMI interrupt to 
inhibit further INTR interrupts. 


Software Interrupts 


A third type of interrupt/exception for the Am386DE 
microprocessor is the software interrupt. An INT n 
instruction causes the processorto execute the interrupt 
service routine pointed to by the nth vector in the inter- 
rupt table. 


Aspecial case of the two byte software interrupt INT nis 
the one byte INT 3 or breakpoint interrupt. By inserting 
this one byte instruction in a program, the user can set 
breakpoints in the program as a debugging tool. 


A final type of software interrupt is the single step inter- 
rupt. Itis discussed in the Debugging Support section on 
page 30. 


Interrupt and Exception Priorities 


Interrupts are externally-generated events. Maskable 
Interrupts (on the INTR input) and Non-Maskable Inter- 
tupts (on the NMI input) are recognized at instruction 
boundaries. When NMI and maskable INTR are both 
recognized at the same instruction boundary, the 
Am386DE microprocessor invokes the NMI service rou- 
tine first. If after the NMi service routine has been 
invoked, maskable interrupts are still enabled, then the 
Am386DE CPU invokes the appropriate interrupt ser- 
vice routine. 


Table 6. Am386DE Microprocessor Priority for 
Invoking Service Routines In Case of 
Simultaneous External Interrupts 


1.NMI 
Exceptions are internally-generated events. Exceptions 
are detected by the Am386DE microprocessor if in the 
course of executing an instruction, the AmM386DE CPU 
detects a problematic condition. The Am386DE micro- 
processor then immediately invokes the appropriate 
exception service routine. The state of the Am386DE 
CPU is such that the instruction causing the exception 
can be restarted. If the exception service routine has 


taken care of the problematic condition, the instruction 
will execute without causing the same exception. 


It is possible for a single instruction to generate several 
exceptions. However, only one exception is generated 
upon each attempt to execute the instruction. Each 
exception service routine should correct its correspond- 
ing exception, and restart the instruction. In this man- 
ner, exceptions are serviced until the instruction 
executes successfully. 
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As the Am386DE microprocessor executes instruc- 
tions, it follows a consistent cycle in checking for excep- 
tions. This cycle is detailed in the sequence of Exception 
Checking and is repeated as each instruction is 
executed and occurs in parallel with instruction decod- 
ing and execution. 


Exception Checking Sequence 
Instruction Restart 


The Am386DE microprocessor fully supports restarting 
all instructions after faults. If an exception is detected in 
the instruction to be executed (Exception Categories 4 
through 9 in the Sequence of Exception Checking), the 
Am386DE device invokes the appropriate exception 
service routine. 


Consider the case of the Am386DE microprocessor 
having just completed an instruction. It then performs 
the following checks before reaching the point where 
the next instruction is completed: 


1. Check for Exception 1 Traps from the instruction just 
completed (single-step via Trap Flag or Data 
Breakpoints set in the Debug Registers). 


2. Check for Exception 1 Faults in the next instruction 
(Instruction Execution Breakpoint set in the Debug 
Registers for the next instruction). 


3. Check for external NMI and INTR. 


4. Check for Segmentation Faults that prevented 
fetching the entire next instruction (Exceptions 11 
and 13). 


5. Check for Faults decoding the next instruction 
[Exception 6 if illegal op-code; Exception 6 if in Real 
Mode and attempting to execute an instruction for 
Protected Mode only (see the Protection section on 
page 45 and the I/O Permission Bitmap section on 
page 46); or Exception 13 if instruction is longer 
than 15 bytes, or privilege violation in Protected 
Mode (i.e., not at IOPL or at CPL = 0)]. 

6. If WAIT op-code, check if TS =1 and MP =1 
(Exception 7 if both are 1). 

7. \IfESCAPE op-code for numeric coprocessor, check 
if EM = 1 or TS = 1 (Exception 7 if either are 1). 

8. If WAIT op-code or ESCAPE op-code for numeric 
coprocessor, check ERROR input signal (Exception 
16 if ERROR input is asserted). 


9. Check each memory reference required by the 
instruction for Segmentation Faults that prevent 
transferring the entire memory quantity (Exceptions 
11, 12, 13). 


Double Fault 


A Double Fault (Exception 8) results when the proces- 
sor attempts to invoke an exception service routine for 
the segment exceptions (10, 11, 12, or 13), but in the 
process of doing so, detects an exception. 


When a Double Fault occurs, the Am386DE micropro- 
cessor invokes the exception service routine for 
Exception 8. 


Reset and Initialization 


When the processor is initialized or Reset, the registers 
have the values shown in Table 7. The Am386DE micro- 
processor will then start executing instructions near the 
top of physical memory, at location FFFFFFFOH. When 
the first Inter-Segment Jump or Call is executed, 
address lines A31—A20 will drop Low for CS-relative 
memory cycles, and the Am386DE microprocessor will 
only execute instructions in the lower 1 Mb of physical 
memory. This allows the system designer to use a ROM 
at the top of physical memory to initialize the system and 
take care of Resets. 


RESET forces the Am386DE microprocessor to termi- 
nate all execution and local bus activity. No instruction 
execution or bus activity will occur as long as Reset is 
active. Between 350 and 450 CLK2 periods after Reset 
becomes inactive, the Am386DE device will start 
executing instructions at the top of physical memory. 


Tabie 7. Register Values after Reset 


Flag Word UUUU0002H 
Machine Status Word (CRO)] UUUUUUU0H 
Instruction Pointer OOOOFFFOH 
Code Segment FOOOH 
Data Segment 0000H 
Stack Segment 0000H 
0000H 
0000H 


0000H 
Component and 
Stepping ID 


Undefined 


Extra Segment (ES) 
Extra Segment (FS) 
Extra Segment (GS) 
DX Register 


All Other Registers 


Notes: 


1. EFLAGS Register. The upper 14 bits of the EFLAGS register are 
undefined, RF (Bit 16) and 0 (Bit 15) are all other defined flag bits, 

2. CRO: (Machine Status Word). All of the defined fields in the CRO 
ara 0 (TS Bit 3, EM Bit 2, MP Bit 1, and PE Bit 0). 

3. Thecode Segment Register (CS) will have its Base Address sat to 
FFFFOOOOH and Limit set to OFFFFH. 

4. Allundefined bits are Reserved for Future Use and should not 
be used. 


5. DX register always holds component and stepping identifier (see 
Saction Component and Revision Identitiers). EAX register holds 
self-test signature if self-tast was requested (see the Saif- Test Sig- 
nature section on page 88). 
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Self-Test 


The Am386DE microprocessor has the capability to per- 
form a self-test. The self-test checks the function of alt 
the Control ROM and most of the non-random logic of 
the part. Approximately one-half of the Am386DE micro- 
processor can be tested during self-test. 


Self-Test is initiated on the AmM386DE microprocessor 
when the RESET pin transitions from High to Low, and 
the BUSY pin is Low. The self-test takes about 2'9 
clocks or approximately 26 ms with a 20-MHz Am386DE 
device. At the completion of self-test, the processor per- 
forms reset and begins normal operation. The part has 
successfully passed self-test if the contents of the EAX 
register are zero (0). If the results of EAX are not zero, 
then the self-test has detected a flaw in the part. 


Debugging Support 


The Am386DE microprocessor provides several fea- 
tures that simplify the debugging process. 


The three categories of on-chip debugging aids are: 


1. The code execution breakpoint op-code (QCCH); 

2. The single-step capability provided by the TF bit in 
the flag register; and, 

3. The code and data breakpoint capability provided by 
the Debug Registers DR3-DRO, DR6, and DR7. 
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Breakpoint instruction 


A single-byte op-code breakpoint instruction is avail- 
able for use by software debuggers. The breakpoint op- 
code is OCCh and generates an Exception 3 trap when 
executed. in typical use, a debugger program can plant 
the breakpoint instruction at all desired code execution 
breakpoints. The single-byte breakpoint op-code is an 
alias for the two-byte general software interrupt instruc- 
tion, INT n, where n = 3. The only difference between INT 
3 (OCCh) and INT nis that INT 3 is never |OPL-sensitive; 
but, INT n is |OPL-sensitive in Protected Mode. 
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Single-Step Trap 


Ifthe single-step flag (TF, bit 8) inthe EFLAGS register is 
found to be set at the end of an instruction, a single-step 
exception occurs. The single-step exception is auto vec- 
tored to Exception 1. Precisely, Exception 1 occurs as a 
trap after the instruction following the instruction that set 
TF. Intypical practice, adebugger sets the TF bit of a flag 
register image on the debugger's stack. It then typically 
transfers control to the user program and loads the flag 
image with a signal instruction, the IRET instruction. The 
single-step trap occurs after executing one instruction of 
the user program. 


Since the Exception 1 occurs as a trap (that is, it occurs 
after the instruction has already executed), the CS:EIP 
pushed onto the debugger’s stack points to the next 
unexecuted instruction of the program being debugged. 
An Exception 1 handler, merely by ending with an IRET 
instruction, can therefore efficiently support single- 
stepping through a user program. 


Debug Registers 


The Debug Registers are an advanced debugging fea- 
ture of the AM386DE microprocessor. They allow data 
access breakpoints as well as code execution break- 
points. Since the breakpoints are indicated by on-chip 
registers, an instruction execution breakpoint can be 
placed in ROM code or in code shared by several tasks, 
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neither of which can be supported by the INT 3 break- 
point op-code. 


The Am386DE microprocessor contains six Debug 
Registers, providing the ability to specify up to four dis- 
tinct breakpoint addresses, breakpoint control options, 
and read breakpoint status. Initially after reset, break- 
points will occur unless the debug registers are pro- 
grammed. Breakpoints set up in the Debug Registers 
are auto-vectored to Exception 1. 


Physical Address Breakpoint Registers 
(DR3-DRO) 

Up to four breakpoint addresses can be specified by writ- 
ing into Debug Registers DR3—DRO, shown in Figure 11. 
The breakpoint addresses specified are 32-bit physical 
addresses. AM386DE microprocessor hardware con- 
tinuously compares the physical breakpoint addresses 
in DR3—-DRO with the physical addresses generated by 
the executing software (the result of computing the 
effective address and adding the 32-bit segment base 
address). 


Debug Control Register (DR7) 

A Debug Control Register, DR7, shown in Figure 11, 
allows several debug control functions, such as enab- 
ling the breakpoints and setting up other control options 
for the breakpoints. The fields within the Debug Control 
Register, DR7, are as follows. 
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Note: 
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DRO 
DRI 
DR2 
DR3 
DR4 
DR5 


DR6& 


DR? 


0 indicates “Reserved for Future Use.” Do not define; see the Compatibility section on page 17. 


Figure 11. Debug Registers 


18421A-014 
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LENI (Breakpoint Length Specification Bits) 


A 2-bit LEN field exists for each of the four breakpoints. 
LEN specifies the length of the associated breakpoint 
field. The choices for data breakpoints are: 1 byte, 2 
bytes, and 4 bytes. Instruction execution breakpoints 
must have a length of 1 (LENi = 00; see Table 8). 


Table 8. LENI Field Encoding 


Usage of Least 
Significant Bits in 
Breakpoint Address 
Register i, (i = 0 — 3) 


All 32 bits used to specify 
a single-byte breakpoint 
field. 


A31-A1 used to specify 
a two-byte, word-aligned 
breakpoint field. AO in 
Breakpoint Address 
Register is not used. 


Breakpoint 
Field Width 


Undefined— 
do not use 
this ancoding 


A31-A2 used to specify 

a four-byte, Dword-aligned 
breakpoint field. AO and 
A1 in Breakpoint Address 
Register are not used. 


The LENi field controls the size of breakpoint field i by 
controlling whether all low-order physical address bits in 
the breakpoint address register are used to detect the 
breakpoint event. Therefore, ail breakpoint fields are 
aligned; 2-byte breakpoint fields begin on Word bound- 
aries and 4-byte breakpoint fields begin on Dword 
boundaries. 


Figure 12 shows various size breakpoint fields. Assume 
the breakpoint physical address in DR2 is O0000005H. 
In that situation, the following illustration indicates the 
region of the breakpoint field for lengths of 1, 2, or 4 
bytes. 


RWI (Memory Access Qualifler Bits) 


A 2-bit RW field exists for each of the four breakpoints. 
The 2-bit RW field specifies the type of usage that must 
occur in orderto activate the associated breakpoint (see 
Table 9). 


RW encoding 00 is used to set up an instruction execu- 
tion breakpoint. RW encodings 01 or 11 are used to set 
up write-only or read/write data breakpoints. 


Note that instruction execution breakpoints are taken as 
faults (i.e., before the instruction executes), but data 
breakpoints are taken as traps (i.e., after the data trans- 
fer takes place). 


Table 9. Associated Breakpoint Encoding 


RW Encoding Usage Causing Breakpoint 


Instruction execution only 


Data writes only 


Undefined—do not use this encoding 


Data reads and writes only 


Using LENi and RWI to Set Data Breakpoint | 


A data breakpoint can be set up by writing the physical 
address into DRi (i = 0-3). For data breakpoints, RWi 
can= 01 (write only) or 11 (write/read). LEN can = 00, 01, 
ort. 


if a data access falls entirely or partly within the data 
breakpoint field, the data breakpoint condition has 
occurred, and if the breakpoint is enabled, an Exception 
1 trap will occur. 


Using LENI and RWi to Set Instruction Execution 
Breakpoint i 

An instruction execution breakpoint can be set up by writ- 
ing address of the beginning of the instruction (including 
prefixes if any) into DRi (i = 0-3). RWi must = 00 and 
LEN must = 00 for instruction execution breakpoints. 


Figure 12. Breakpoint Fields 
LEN2 = 00B 


DR2 = 00000005H; 


00000008H 
00000004H 
00000000H 
DR2 = 00000005H; LEN2 = 01B 
31 0 
00000008H 
00000004H 
00000000H 
DR2 = 00000005H; LEN2 = 11B 
31 0 
00000008H 
bkpt fld2 —_> 00000004H 
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If the instruction beginning at the breakpoint address is 
about to be executed, the instruction execution break- 
point condition has occurred, and if the breakpoint is 
enabled, an Exception 1 fault will occur before the 
instruction is executed. 


Note that an instruction execution breakpoint address 
must be equal to the beginning byte address of an 
instruction (including prefixes) in order for the instruction 
execution breakpoint to occur. 


GD (Global Debug Register Access Detect) 


The Debug Registers can only be accessed in Real 
Mode or at privilege level 0 in Protected Mode. The GD 
bit, when set, provides extra protection against any 
Debug Register access even in Real Mode or at privi- 
lege level 0 in Protected Mode. This additional protec- 
tion feature is provided to guarantee that a software 
debugger or emulator can have full control over the 
Debug Register resources when required. The GD bit, 
when set, causes an Exception 1 fault if an instruction 
attempts to read or write any Debug Register. The GD bit 
is then automatically cleared when the Exception 1 han- 
dler is invoked, allowing the Exception 1 handler free 
access to the debug registers. 


GE and LE (Exact Data Breakpoint Match, Global 
and Local) 

if either GE or LE is set, any data breakpoint trap will be 
reported exactly after completion of the instruction that 
caused the operand transfer. Exact reportingis provided 
by forcing the Am386DE microprocessor execution unit 
to wait for completion of data operand transfers before 
beginning execution of the next instruction. 


If exact data breakpoint match is not selected, data 
breakpoints may not be reported until several instruc- 
tions later or may not be reported at all. When enabling a 
data breakpoint, it is therefore recommended to enable 
the exact data breakpoint match. 


When the Am386DE microprocessor performs a task 
switch, the LE bit is cleared. Thus, the LE bit supports 
fast task switching out of tasks that have enabled the 
exact data breakpoint match for their task-local break- 
points. The LE bit is cleared by the processor during a 
task switch to avoid having exact data breakpoint match 
enabled inthe new task. Note that exact data breakpoint 
match must be re-enabled under software control. 


The Am386DE microprocessor GE bit is unaffected dur- 
ing a task switch. The GE bit supports exact data break- 
point match that is to remain enabled during all tasks 
executing in the system. 


Note that instruction execution breakpoints are always 
reported exactly, whether or not exact data breakpoint 
match is selected. 


Gl and Li (Breakpoint Enable, Global and Local) 

If either Gior Liis set, then the associated breakpoint (as 
defined by the physical address in DRi, the length in 
LENi and the usage criteria in RWi) is enabled. If either 
Gi or Li is set and the Am386DE microprocessor 
detects the breakpoint condition, then the Exception 1 
handler is invoked. 


When the Am386DE microprocessor performs a task 
switch to a new Task State Segment (TSS), all Libits are 
cleared. Thus, the Li bits support fast task switching out 
of tasks that use some task-local breakpoint registers. 
The Li bits are cleared by the processor during a task 
switch to avoid spurious exceptions in the new task. 
Note that the breakpoints must be enabled under soft- 
ware control. 


All Am386DE microprocessor Gibits are unaffected dur- 


ing a task switch. The Gi bits support breakpoints that 
are active in all tasks executing in the system. 


Debug Status Register (DR6) 


A Debug Status Register, DR6, shown in Figure 11, 
allows the Exception 1 handler to easily determine why it 
was invoked. Note the Exception 1 handler can be 
invoked as a result of one of several events. 


DRO Breakpoint fault/rap. 
DRi Breakpoint faulttrap. 
DR2 Breakpoint faultArap. 
DR3 Breakpoint faultArap. 
Single-step (TF) trap. 
Task switch trap. 


Fault due to attempted debug register access when 
GD = 1. 

The Debug Status Register contains single-bit flags for 
each of the possible events invoking Exception 1. Note 
below that some of these events are faults (exception 
taken before the instruction is executed), while other 
events are traps (exception taken after the debug 
events occurred). 


NOAA PON = 


The flags in DR6 are set by the hardware but never 
cleared by hardware. Exception 1 handler software 
should clear DR6 before returning to the user program 
to avoid future confusion in identifying the source of 
Exception 1. 


The fields within the Debug Status Register DR6 are as 
follows. 


Bi (Debug Fault/Trap Due to Breakpoint 0-3) 

Four breakpoint indicator flags, B3-B0, correspond 
one-to-one with the breakpoint registers in DR3-DRO. A 
flag Bi is set when the condition described by DRi, LENi, 
and RWi occurs. 


If Gi or Liis set, and if the breakpoint is detected, the pro- 
cessor will invoke the Exception 1 handler. The exception 
is handled as a fault if an instruction execution breakpoint 
occurred or as a trap if a data breakpoint occurred. 
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important Note: A flag, Bi, is set whenever the hard- 
ware detects a match condition on enabled breakpointi. 
Whenever a match is detected on at least one enabled 
breakpoint i, the hardware immediately sets all Bi bits 
corresponding to breakpoint conditions matching at that 
instant, whether enabled or not. Therefore, the Excep- 
tion 1 handler may see that multiple Bi bits are set, but 
only set Bi bits corresponding to enabled breakpoints (Li 
or Gi set) are true indications of why the Exception 1 
handler was invoked. 


BD (Debug Fauit Due to Attempted Register Access 
When GD Bit Set) 

This bit is set if the Exception 1 handler was invoked due 
to an instruction attempting to read or write to the debug 
registers when GD bit was set. If such an event occurs, 
thenthe GD bit is automatically cleared when the Excep- 
tion 1 handler is invoked, allowing handler access to the 
debug registers. 


BS (Debug Trap Due to Single-Step) 

This bit is set if the Exception 1 handler was invoked due to 
the TF bit in the flag register being set (for single-stepping). 
See the Single-Step Trap section on page 31. 


BT (Debug Trap Due to Task Switch) 


This bit is set if the Exception 1 handler was invoked due 
to a task switch occurring to a task having an AM386DE 
microprocessor TSS with the T-bit set. (See Figure 26 
on page 47) Note the task switch into the new task 
occurs normally, but before the first instruction of the 
task is executed, the Exception 1 handler is invoked. 
With respect to the task switch operation, the operation is 
considered to be a trap. 
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Use of Resume Flag (RF) In Flag Register 


The Resume Flag (RF) in the flag word can suppress an 
instruction execution breakpoint when the Exception 1 
handler retums to a user program at a user address that 
is also an instruction execution breakpoint. See the 
Flags Register section on page 13. 


REAL MODE ARCHITECTURE 
Real Mode Introduction 


When the processor is reset or powered up, it is initial- 
ized in Real Mode. Real Mode has the same base archi- 
tecture as the 8086, but allows access to the 32-bit 
register set of the Am386DE microprocessor. The 
addressing mechanism, memory size, and interrupt 
handling are all identical to the Real Mode on the 80286. 


Most Am386DE microprocessor instructions are avail- 
able in Real Mode (except those instructions listed in the 
Protection section on page 45 and the I/O Permission 
Bitmap section on page 46). The default operand size in 
Real Mode is 16 bits, just like the 8086. In order to use 
the 32-bit registers and addressing modes, override 
prefixes must be used. !n addition, the segment size on 
the AmM386DE CPU in Real Mode is 64 Kbytes so 32-bit 
effective addresses must have a value less than 
OOOOFFFFH. The primary purpose of Real Mode is to 
set up the processor for Protected Mode Operation. 


Several instructions, particularly those applying to the 
multitasking model and protection model, are available 
only in Protected Mode. Therefore, attempting to 


Max Limit 
Fixed At 64K In 
Real Mode 


Selected 
Segment 
18421A-015 


Figure 13. Real Address Mode Addressing 
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execute the following instructions in Real Mode gener- 
ates an Exception 6 fault. 


LTR; STR; 
LLDT; SLDT; 
LAR; VERR ; 
LSL; VERW; 
ARPL. 

LOCK Operation 


Table 10 shows the only instruction forms where the 
LOCK prefix is legal on the Am386DE microprocessor. 


Table 10. Legal Lock Prefix Instruction Forms 


BIT TEST and 
SET/RESET/COMPLEMENT 


Mem, Reg/immed 
Reg, Mem 
Mem, Reg 


ADD, OR, ADC, SBB, AND, SUB, XOR| Mem, Reg/immed 
NOT, NEG, INC, DEC Mem 


An Exception 6 will be generated if a LOCK prefix is 
placed before any instruction form or op-code not listed 
above. The LOCK prefix allows indivisible read/modify/ 
write operations on memory operands using the instruc- 
tions above. For example, even the ADD Reg, Mem is 
not LOCKabie, because the Mem operand is not the 
destination (and therefore no memory read/modify/ 
operation is being performed). 


Since, on the Am386DE microprocessor, repeated 
String instructions are not LOCKable, it is not possible to 
LOCK the bus for a long period of time. Therefore, the 
LOCK prefix is not |OPL-sensitive on the Am386DE 
device. The LOCK prefix can be used at any privilege 
level, but only on the instruction forms listed above. 


Memory Addressing 


In Real Mode, the maximum memory size is limited to 
1 Mbyte. Thus, only address lines A19-A2 are active. 
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One exception is that the High address lines A31-A20 
are High during CS-relative memory cycles until an 
intersegment jump or call is executed (see Section 
Reset and Initialization). 


Physical addresses are formed in Real Mode by adding 
the contents of the appropriate segment register that is 
shifted lett by 4 bits to an effective address. This addition 
results in a physical address from 00000000H to 
0010FFEFH. This is compatible with 80286 Real Mode. 
Since segment registers are shifted left by 4 bits, this 
implies that Real Mode segments always start on 
16-byte boundaries. 


All segments in Real Mode are exactly 64 Kbytes long 
and may be read, written, or executed. The Am386DE 
microprocessor will generate an Exception 13 if a data 
operand or instruction fetch occurs past the end of a 
segment (i.e., if an operand has an offset greater than 
FFFFH; for example, a word with a low byte at FFFFH 
and the high byte at 0000H). 


Segments may be overlapped in Real Mode. Thus, ifa 
particular segment does not use all 64 Kbytes, 
another segment can be overlaid on top of the unused 
portion of the previous segment. This allows the 
programmer to minimize the amount of physical 
memory needed for a program. 


Reserved Locations 


There are two fixed areas in memory that are reservedin 
Real address mode: system initialization area and the 
interrupt table area. Locations 00000H through 003FFH 
are reserved for interrupt vectors. Each one of the 256 
possible interrupts has a 4-byte jump vector reserved for 
it. Locations FFFFFFFOH through FFFFFFFFH are 
reserved for system initialization. 


Interrupts 


Many of the exceptions shown in Table 5 and discussed 
in the Interrupts section on page 26 are not applicable to 
Real Mode operation; in particular, Exceptions 10 and 
11 will not happen in Real Mode. Other exceptions have 
slightly different meanings in Real Mode. Table 11 
identifies these exceptions. 


Table 11. Other Exceptions In Real Mode 


Interrupt 
Number 


Interrupt table limit too small 


CS, DS, ES, FS, GS 
Segment overrun exception 


Related Instructions 


beyond offset = FFFFH. 
An attempt to execute 
past the end of CS segment. 


Return Address Location 


INT Vector is not within table limit. Before Instruction 


Word memory reference 


Before Instruction 


SS Segment overrun exception Ba detec EFFFH, Before Instruction 
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Shutdown and Halt 


The HLT instruction stops program execution and pre- 
vents the processor from using the focai bus until 
restarted. Either NMI, FLT, INTR withinterupts enabled 
(IF = 1), or RESET will force the Am386DE micropro- 
cessor out of halt. If interrupted, the saved CS:IP will 
point to the next instruction after the HLT. 


Shutdown will occur when a severe error is detected that 
prevents further processing. In Real Mode, shutdown 
can occur under two conditions: 


@ An interrupt or an exception occurs (Exception 8 or 
13) and the interrupt vector is larger than the 
Interrupt Descriptor Table (i.e., there is not an 
interrupt handler for the interrupt); 


@® ACALL, INT, or PUSH instruction attempts to wrap 
around the stack segment when SP is not even (e.g., 
pushing a value on the stack when SP = 0001 
resulting in a stack segment greater than FFFFH). 

An NMI input can bring the processor out of shutdown if 
the Interrupt Descriptor Table limit is large enough to 
contain the NMI interrupt vector (at least 0017H) andthe 
stack has enough room to contain the vector and flag 
information (i.¢., SP is greater than 0005H). Otherwise 
shutdown can only be exited via the RESET input. 


PROTECTED MODE ARCHITECTURE 
Introduction 


The complete capabilities of the Am386DE micropro- 
cessor are unlocked when the processor operates in 


48/32 Bit Pointer 


Selector 


Segment 
Descriptor 


Segment Base 
Address 


Protected Address Mode (Protected Mode). Protected 
Mode vastly increases the physical address space to 4 
Gbytes (252 bytes). In addition, Protected Mode allows 
the AmM3886DE CPU to run all of the existing 8086 and 
80286 software, while providing a sophisticated 
memory management and a hardware-assisted protec- 
tion mechanism. Protected Mode allows the use of addi- 
tional instructions especially optimized for supporting 
multitasking operating systems. The base architecture 
of the Am386DE CPU remains the same; the registers, 
instructions, and addressing modes described in the 
previous sections are retained. The main differences 
between Protected Mode and Real Mode from a pro- 
grammer's view is the increased address space and a 
different addressing mechanism. 


Addressing Mechanism 


Like Real Mode, Protected Mode uses two components 
to form the logical address: a 16-bit selector is used to 
determine the linear base address of a segment; the 
base address is added to a 32-bit effective address to 
form a 32-bit physical address. 


The difference between the two modes lies in calculat- 
ing the base address. In Protected Mode, the selector is 
used to specify an index into an operating system 
defined table (see Figure 14). The table contains the 
32-bit base address of a given segment. The physical 
address is formed by adding the base address obtained 
from the table to the offset. 


Segment Limit 


Selected 
Segment 
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Figure 14. Protected Mode Addressing 
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Segmentation Introduction 


Segmentation is one method of memory management 
and provides the basis for protection. Segments are 
used to encapsulate regions of memory that have com- 
mon attributes. For example, all of the code of a given 
program could be contained in a segment or an operat- 
ing system table may reside in a segment. All informa- 
tion about a segment is stored in an 8-byte data 
structure called a descriptor. All of the descriptors in a 
system are contained in tables recognized by hardware. 


Terminology 


The following terms are used throughout the discussion 
of descriptors, privilege levels, and protection: 


PL: Privilege Levet—One of the four hierarchical privi- 
lege levels. Level 0 is the most privileged level and level 
3 is the least privileged. More privileged levels are 
numerically smaller than less privileged levels. 


RPL: Requester Privilege Level—The privilege level of 
the original supplier of the selector. RPL is determined 
by the feast two significant bits of a setector. 


DPL: Descriptor Privilege Levet—This is the least privi- 
leged level at which a task may access that descriptor 
(and the segment associated with that descriptor). Des- 
criptor Privilege Level is determined by bits 6-5 in the 
Access Right Byte of a descriptor. 


CPL: Current Privilege Level—The privilege level at 
which a task is currently executing, which equals the 
privilege level of the code segment being executed. CPL 
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can also be determined by examining the lowest 2 bits of 
the CS register, except for conforming code segments. 


EPL: Effective Privilege Level—The effective privilege 
level is the least privileged of the RPL and DPL. Since 
small privilege level values indicate greater privilege, 
EPL is the numerical maximum of RPL and DPL. 


Task: One instance of the execution of aprogram. Tasks 
are also referred to as processes. 


Descriptor Tables 


Descriptor Tables introduction 


The descriptor tables define all of the segments which 
are used in an Am386DE microprocessor system. 
There are three types of tables on the AM386DE micro- 
processor that hold descriptors: the Global Descriptor 
Table, Local Descriptor Table, and the interrupt Descrip- 
tor Table. All of the tables are variable length memory 
arrays. They can range in size between 8 bytes and 64 
Kbytes. Each table can hold up to 8192 eight byte des- 
criptors. The upper 13 bits of a selector are used as an 
index into the descriptor table. The tables have registers 
associated with them that hold the 32-bit base address, 
and the 16-bit limit of each table. 


Each of the tables has a register associated with it: the 
GDTR, LDTR, and the IDTR (see Figure 15). The LGDT, 
LLOT, and LIDT instructions load the base and limit of 
the Global, Local, and Interrupt Descriptor Tables, 
respectively, into the appropriate register. The SGDT, 
SLDT, and SIDT instructions store the base and limit val- 
ues. These tables are manipulated by the operating sys- 
tem. Therefore, the load descriptor table instructions are 
privileged instructions. 


LDTR 


15 ie} 
LDT DESCR 
Selector 
15 0 


IDT Limit 
IDT Base 
Physical Address 
1 0 


15 Q 
GODT Limit 
GDT Base 
Physical Address 
1 0 


\OTR 


3 


GDTR 


3 


LDT Limit 
Physical Address 
31 0 

Program Invisible 


Automatically Loaded 
From LOT Descriptor 
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Figure 15. Descriptor Table Registers 
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Global Descriptor Table 


The Global Descriptor Table (GDT) contains descriptors 
that are possibly available to all of the tasks in a system. 
The GDT can contain any type of segment descriptor 
except for descriptors that are used for servicing inter- 
rupts (i.e., interrupt and trap descriptors). Every 
Am386DE microprocessor contains a GDT. Generally, 
the GDT contains code and data segments used by the 
operating systems and task state segments and des- 
criptors for the LDTs in a system. 


The first slot of the Global Descriptor Table corresponds 
to the null selector and is not used. The null selector 
defines a null pointer value. 


Local Descriptor Table 


LDTs contain descriptors that are associated with a 
given task. Generally, operating systems are designed 
so that each task has a separate LDT. The LDT may con- 
tain only code, data, stack, task gate, and cali gate des- 
criptors. LDTs provide a mechanism for isolating a given 
task’s code and data segments fromthe rest of the oper- 
ating system, while the GDT contains descriptors for 
segments that are common to all tasks. A segment can- 
not be accessed by a task if its segment descriptor does 
not exist in either the current LDT or the GDT. This pro- 
vides both isolation and protection for a task’s seg- 
ments, while still allowing global data to be shared 
among tasks. 


IDT Limit fin 
IDT Base 


Gate For 
Interrupt #n 


Unlike the 6-byte GDT or IDT registers that contain a 
base address and limit, the visible portion of the LDT 
register contains only a 16-bit selector. This selector 
refers to a Local Descriptor Table descriptor in the GDT. 


interrupt Descriptor Table 


The third table needed for Am386DE microprocessor 
systems is the Interrupt Descriptor Table (see Figure 16). 
The IDT contains the descriptors that point to the loca- 
tion of up to 256 interrupt service routines. The DT may 
contain only task gates, interrupt gates, and trap gates. 
The IDT should be at least 256 bytes in size in order to 
hold the descriptors for the 32, Reserved for Future Use, 
interrupts. Every interrupt used by a system must have 
an entry in the IDT. The IDT entries are referenced via 
INT instructions, external interrupt vectors, and excep- 
tions. (See the Interrupts section on page 26.) 


Gate For 
Interrupt #n — 1 


Interrupt 
Descriptor 
Table 
(IDT) 
Gate For 
Interrupt #1 
Gate For ; 
Interrupt #0 Increasing 
Memory 
Addresses 
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Figure 16. Interrupt Descriptor Table Register Use 
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Descripior Attribute Bits 


The object to which the segment selector points is called 
a descriptor. Descriptors are 8-byte quantities that con- 
tain attributes about a given region of physical address 
space (i.e., a segment). These attributes include the 
32-bit base physical address of the segment, the 20-bit 
length and granularity of the segment, the protection 
level, read, write, or execute privileges, the default size 
of the operands (16 bit or 32 bit), and the type of seg- 
ment. All of the attribute information about a segment is 
contained in 12 bits inthe segment descriptor. Figure 17 
shows the general format of a descriptor. All segments 
on the Am386DE microprocessor have three attribute 
fields in common: the P bit, the DPL bit, and the S bit. 
The Present P bitis 1 if the segment is loaded in physical 
memory; if P = 0 then any attempt to access this seg- 
ment causes a not present exception (Exception 11). 
The Descriptor Privilege Level (DPL) is a 2-bit field 
that specifies the protection levels 0-3 associated 
with a segment. 


The Am386DE microprocessor has two main categories 
of segments: system segments and non-system seg- 
ments (for code and data). The segment S bit inthe seg- 
ment descriptor determines if a given segment is a 
system segment or acode or data segment. If the S bit is 
1, then the segment is either a code or data segment: if it 
is 0, then the segment is a system segment. 
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Am386DE Microprocessor Code and Data 
Descriptors (S = 1) 

Figure 18 shows the general format of a code and data 
descriptor and Table 12 illustrates how the bits in the 
Access Rights Byte are interpreted. 


Code and data segments have several descriptor fields 
in common. The accessed A bit is set whenever the pro- 
cessor accesses a descriptor. The A bit is used by oper- 
ating systems to keep usage statistics on a given 
segment. The G bit, or granularity bit, specifies if a seg- 
ment length is byte-granular or page-granular. 
Am386DE microprocessor segments can be 1 Mbyte 
long with byte granularity (G = 0) or 4 Gbytes with page 
granularity (G = 1), (i.e., 22° pages—each page is 4 
Kbytes in length). An Am386DE CPU system can con- 
sist of segments with byte granularity and page 
granularity. 


The executable E bit tells if a segment is a code or data 
segment. A code segment (E=1, S=1) may he 
execute-only or execute/read as determined by the 
Read R bit. Code segments are execute only if R = 0 
and execute/read if R =1. Code segments may never be 
written into. 


Note: Code segments may be modified via aliases. 
Aliases are writeable data segments that occupy the 
same range of physical address space as the code seg- 
ment. 


The D bit indicates the default length for operands and 


effective addresses. lf D = 1, then 32-bit operands and 
32-bit addressing modes are assumed. If D = 0, then 
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Byte 
0 Address 


3 DPL Type 
| Limit Base 
j_sewaret falafel} ct [el Y'}s] Yh fe] se} « 
Base Sase Address of the segment 
Limit The length of the segment 
P Present Bit: 1 = Present, 0 = Not Present 
DPL Descriptor Privilege Levels 0-3 
Ss Segment Descriptor: 0 = System Descriptor, 1 = Code or Data Segment Descriptor 
Type Type of Segment 
A Accessed Bit 
G Granularity Bit: 1 = Segment length is page granular; 0 = Segment length is byte granular 
D Default Operation Size (recognized in code segment descriptors only): 1 = 32-bit segment, 0 = 16-bit segment 


Bit must be zero (0) for compatibility with future processors 


0 
AVL Available field for user or OS 


Note: 


in a maximum-size segment (i.e., segment with G = 1 and segment limit 19~0 = FFFFFH), the lowest 12 bits of the 
sagment base should be zero (i.e., segment base 11-000 = 000H). 


Figure 17. General Format of Segment Descriptors 
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Segment Limit 15-0 0 


Base 
23-16 


D/B 1 = Default Instructions Attributes are 32 bits 
0 = Defaut Instructions Attributes are 16 bits 
AVL Available fiald for user or OS 
G Granularity Bit: 1 = Segment length is page granular; 0 = Segment length is byte granular 
0 Bit must be zero (0) for compatibility with future processors 
Note: 


In a maximum-size segment (i.e., a segment with G = 1 and segment limit 19-0 = FFFFFH), the lowest 12 bits of 
the segment base should be zero (i.e., segment base 11-000 = 000H). 
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Figure 18. Code and Data Segment Descriptors 


Table 12. Access Rights Byte Definition for Code and Data Descriptions 


Bit 
Position Name Function 


Present (P) 


Descriptor Privilege 
Levels (DPL) 
Segment Descriptor (S) 


Executable (E) 
Expansion Direction (ED) 


OOn 


Writeable (W) 


Executable (E) 
Conforming (C) 


Readable (RA) 


>>| Dp Om! ssmmm 


Accessed (A) 


7 
6-5 
4 
3 
2 
1 


16-bit operands and 16-bit addressing modes are 
assumed. Therefore all existing 80286 code segments 
willexecute on the Am386DE microprocessor assuming 
the D bit is set 0. 


Another attribute of code segments is determined by the 
contorming C bit. Conforming segments, C = 1, can be 
executed and shared by programs at ditterent privilege 
levels (see the Protection section on page 45). 


Segments identified as data segments (E = 0, S = 1) 
are used for two types of Am386DE microprocessor 
segments: stack and data segments. The expansion 
direction (ED) bit specities it a segment expands down- 
ward (stack) or upward (data). If a segment is a stack 
segment, all offsets must be greater than the segment 


Segment is mapped into physical memory. 

No mapping to physical memory exists, base and limit are 
not used. 

Segment privilege attribute used in privilege tests. 


Code or Data (includes stacks) segment descriptor. 
System Segment Descriptor or Gate Descriptor. 


Descriptor type is data segment. 

Expand up segment, offsets must be < limit. 
Expand down segment, offsets must be > limit. 
Data segment may not be written into. 

Data segment may be written into. 


i Data 
Segment 
(S=1, 
E =0) 


Descriptor type is code segment. 

Code segment may only be executed when 
CPL 2 DPL and CPL remains unchanged. 
Code segment may not be read. 

Code segment may be read. 


If Code 
Segment 
(S=1, 
E=1) 
Segment has not been accessed. 


Segment selector has been loaded into sagment register 
or used by selector test instructions. 


limit. On a data segment all offsets must be less than or 
equal to the limit. In other words, stack segments start at 
the base physical address plus the maximum segment 
limit and grow down to the base physical address plus 
the limit. On the other hand, data segments start at the 
base physical address and expand to the base physical 
address plus limit. 


The write (W) bit controls the ability to write into a seg- 
ment. Data segments are read-only if W = 0. The stack 
segment must have W = 1. 


The B bit controls the size of the stack pointer register. If 
B= 1, then PUSHes, POPs, and CALLs all use the 
32-bit ESP register for stack references and assume an 
upper limit of FFFFFFFFH. If B = 0, stack instructions 
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alluse the 16-bit SP register and assume an upper limit 
of FFFFH. 


System Descriptor Formats 


System segments describe information about operating 
system tables, tasks, and gates. Figure 19 shows the 
general format of system segment descriptors, and the 
various types of system segments. The Am386DE 
microprocessor system descriptors contain a 32-bit 
base physical address and a 20-bit segment limit. 80286 
system descriptors have a 24-bit base address and a 
16-bit segment limit. 80286 system descriptors are iden- 
tified by the upper 16 bits being ail zeros. 


LDT Descriptors (S = 0, Type = 2) 


LOT descriptors (S = 0, TYPE = 2) contain information 
about Local Descriptor Tables. LDTs contain a table of 
segment descriptors, unique to a particular task. Since 
the instruction to load the LOTR is only available at privi- 
lege level 0, the DPL field is ignored. LDT descriptors 
are only allowed in the Global Descriptor Table (GDT). 


TSS Descriptors (S = 0, Type = 1, 3, 9, B) 

A Task State Segment (TSS) descriptor contains 
information about the location, size, and privilege level 
of a TSS. ATSS inturn is a special tixed format segment 
that contains all the state information for a task and a 
linkage field to permit nesting tasks. The Type field is 
used to indicate whether the task is currently BUSY (i.e., 
on a chain of active tasks) or the TSS is available. The 
Type field also indicates if the segment contains a 80286 
oran Am386DE microprocessor TSS. The Task Register 
(TR) contains the selector that points to the current TSS. 


Gate Descriptors (S = 0, Type = 4-7, C, F) 


Gates are used to control access to entry points within 
the target code segment. The various types of gate 
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Segment Base 15~0 


Type Definition 


Invalid 

Available 80286 TSS 

LOT 

Busy 80286 TSS 

80286 Call Gate 

Task Gate (for 80286 or Am386DE CPU Task) 
80286 Interrupt Gate 

80286 Trap Gate 


NOMPOM-O 


Note: 


descriptors are call gates, task gates, interrupt gates, 
and trap gates. Gates provide a level of indirection 
between the source and destination of the control trans- 
fer. This indirection allows the processor to automati- 
cally perform protection checks. It also allows system 
designers to control entry points to the operating sys- 
tem. Call gates are used to change privilege levels (see 
the Protection section on page 45), task gates are used 
to perform atask switch, and interrupt and trap gates are 
used to specify interrupt service routines. 


Figure 20 shows the format of the four types of gate des- 
criptors. Call gates are primarily used to transfer pro- 
gram control to a more privileged level. The call gate 
descriptor consists of three fields: the access byte; a 
long pointer (selector and offset) that points to the start 
of a routine; and a word count that specifies how many 
parameters are to be copied from the caller's stack to 
the stack of the called routine. The word count field is 
only used by call gates when there is a change in the 
privilege level, other types of gates ignore the word 
count field. 


Interrupt and trap gates use the destination selector and 
destination offset fields of the gate descriptor as a 
pointer to the start of the interrupt or trap handler rou- 
tines. The difference between interrupt gates and trap 
gates is that the interrupt gate disables interrupts (resets 
the IF bit) while the trap gate does not. 


Task gates are used to switch tasks. Task gates may 
only refer to a task state segment (see the Task Switch- 
ing section on page 49); therefore, only the destination 
selector portion of a task gate descriptor is used, andthe 
destination offset is ignored. 


Exception 13 is generated when a destination selector 
does not refer to a correct descriptor type (i.e., a code 


16 0 
Segment Limit 15-0 0 


Definition 


Limit DPL Type Base 
coe PEL TEL el 
pe 


Ty 


Invalid 

Available Am386DE CPU TSS 
Undefined (Reserved) 

Busy Am386DE CPU TSS 
Am386DE CPU Call Gate 
Undefined (Reserved) 
Am386DE CPU Interrupt Gate 
Am386DE CPU Trap Gate 


TMMOVODPYoCoH 


in a maximum-size segment (i.e., segment with G = 1 and segment limit 19-0 = FFFFFH), the lowest 12 bits 
of the segment base should be zero (i.e., segment base 11-000 = 000H). 


Figure 19. System Segments Descriptors 
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Name Value 
Type 4 


ODPL Type Word 
Offset 31-16 Count 
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Offset 15-0 


Description 
80286 Cail Gate 


Task Gate (for 80286 or Am386DE CPU Task) 
80286 Interrupt Gate 

80286 Trap Gate 

Am386DE CPU Cali Gate 


Am386DE CPU Trap Gate 


6 

7 

Cc 

E Am386DE CPU Interrupt Gate 

F 

0 Descriptor contents are not valid 
1 


Descriptor contents are valid 


DPL—Least privileged level at which a task may access the gate. WORD COUNT 0-31—The number of parameters to 
copy from caller's stack to the called procedure’s stack. The parameters are 32-bit quantities for AM386DE CPU gates, and 


16-bit quantities for 80286 gates. 


DESTINATION 16-Bit 
SELECTOR Selector 
DESTINATION Offset 
OFFSET 16-bit 80286 


32-bit Am386DE CPU 


Figure 20. Gate Descriptor Formats 


Selector to the target code segment 
or 
Selector to the target state segment for task gate 


Entry point within the target code segment 
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segment for an interrupt, trap, or call gate, a TSS fora 
task gate). 


The access byte format is the same for all gate descrip- 
tors. P = 1 indicates that the gate contents are valid. 
P = 0 indicates the contents are not valid and causes 
Exception 11 if referenced. DPL is the descriptor privi- 
fege level and specifies when this descriptor may be 
used by a task (see the Protection section on page 45). 
The §S field, bit 4 of the access rights byte, must be 0 to 
indicate a system control descriptor. The type field spec- 
ities the descriptor type as indicated in Figure 20. 


Difference Between Am386DE Microprocessor 
and 80286 Descriptors 


In order to provide operating system compatibility 
between the 80286 and AmM386DE microprocessor, the 
Am386DE CPU supports all of the 80286 segment des- 
criptors. Figure 21 shows the general format of an 
80286 system segment descriptor. The only differences 
between 80286 and Am386DE device descriptor for- 
mats are that the values of the type fields and the limit 
and base address fields have been expanded for the 
Am386DE device. The 80286 system segment descrip- 
tors contained a 24-bit base address and 16-bit limit, 
while the Am386DE microprocessor system segment 


descriptors have a 32-bit base address, a 20-bit limit 
field, and a granularity bit. 


By supporting 80286 system segments, the AM386DE 
microprocessor is able to execute 80286 application 
programs on an Am386DE CPU operating system. This 
is possible because the processor automatically under- 
stands which descriptors are 80286-style descriptors 
and which are Am386DE microprocessor-style descrip- 
tors. In particular, if the upper word of a descriptor is 
zero, then that descriptor is an 80286-style descriptor. 


The only other differences between 80286-style des- 
criptors and Am386DE microprocessor descriptors is 
the interpretation of the word count field of call gates and 
the B bit. The word count field specifies the number of 
16-bit quantities to copy for 80286 call gates and 
32-bit quantities for AM386DE device call gates. The 
B bit controls the size of PUSHes when using a call 
gate; if B = 0, then PUSHes are 16 bits, if B = 1, then 
PUSHes are 32 bits. 


Selector Flelds 


A selector in Protected Mode has three fields: Local or 
Global Descriptor Table Indicator (TI), Descriptor Entry 
Index (Index), and Requestor (the selector’s) Privilege 
Level (RPL) as shown in Figure 22. The TI bits select 
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Present Bit: 1 = Present, 0 = Not Present 
Descriptor Privilege Levels 0-3 
System Descriptor: 0 = System, 1 = User 
Type of Segment 

Figure 21. 80286 Code and Data Segment Descriptors 18421A-023 


one of two memory-based tables of descriptors (the Segment Descriptor Cache 


Global Descriptor Table or the Local Descriptor Table). 
The Index selects one of 8K descriptors in the appropri- 
ate descriptor table. The RPL bits allow high speed test- 
ing of the selector’s privilege attributes. 


\n addition to the selector value, every segment register 
has a segment descriptor cache register associated 
with it. Whenever a segment register's contents are 
changed, the 8-byte descriptor associated with that 


selector is automatically loaded (cached) on the chip. 
Once loaded, all references to that segment use the 
cached descriptor information instead of reaccessing 


Selector 


15 4321 0 


Sagment 
Register 


Descriptor 
Number 


0 
Local Descriptor Table Global Descriptor Table 
Figure 22. Example Descriptor Selection 18421A-024 
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the descriptor. The contents of the descriptor cache are 
not visible to the programmer. Since descriptor caches 
only change when a segment register is changed, pro- 
grams that modify the descriptor tables must reload the 
appropriate segment registers after changing a descrip- 
tor's value. 


Segment Descriptor Register Settings 


The contents of the segment descriptor cache vary 
depending on the operating mode of the Am386DE 
microprocessor. When operating in Real Address 
Mode, the segment base, limit, and other attributes 
within the segment cache registers are defined as 
shown in Figure 23. 


For compatibility with the 8086 architecture, the base is 
set to 16 times the current selector value, the limit is 
fixed at OOOOFFFFH, and the attributes are fixed to indi- 
cate that the segment is present and fully usable. In Real 
Address Mode, the internal privilege level is always 
fixed to the highest level, level 0, so I/O and other privi- 
leged op-codes may be executed. 


When operating in Protected Mode, the segment base, 
limit, and other attributes within the segment cache reg- 
isters are defined as shown in Figure 24. In Protected 
Mode, each of these fields are detined according to the 
contents of the segment descriptor indexed by the 
selector value loaded into the segment register. 


Segment Descriptor Cache Register Contents 


32-Bit Base 
(Updated During Selector 
Load into Segment Register) 


32-Bit Limit 
(Fixed) 
Conforming Privilege 
Stack Size 
Executable 
Writeable 
Readable 
Expansion Direction 
Granularity 
Accessed 
Privilege Level 


Other Attributes 
(Fixed) 


Present 
BASE LIMIT i 


a 


CS | 16X Current CS Selector’ OOOOFFFFH 


Y] 0 
Froxcurent Ss Secor | _oooorrrre | vo) 
[rex Curent 05 Somer | oooorrern |v 2. 
Prox curent €S Seocor | _oooorrerH | ¥ [0 
mn 

0 


16X Current FS Selector OOOOFFFFH 


Key: Y = Yes D = Expand down 
N = No B = Byte granularity 
0 = Privilege level 0 P = Page granularity 
1 = Privilege level 1 W = Push/pop 16-bit words 
2 = Privilege level 2 F = Push/pop 32-bit Dwords 
3 = Privilege level 3 — = Does not apply to that segment cache register 
U = Expand up 
Note: 


*Except the 32-bit CS base is initialized to FFFFFOOOH after reset until first intersegment control transfer 
(e.g., intersegment CALL, or intersegment JMP, or INT). 


Figure 23. Segment Descriptor Caches for Real Address Mode 18421A-025 
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Segment Descriptor Cache Register Contents 


32-Bit Base 
(Updated During Selector 
Load into Segment Register) 


Contorming Privilege 
Stack Size 
Executable 
Writeable 
Readable 
Expansion Direction 
Granularity 
Accessed 
Privilage Level 
Present 


BASE LIMIT 


CS | Base per Seg Descr 
ss 
DS 
ES 
FS 
GS | Base per Seg Descr 


32-Bit Limit 
(Updated During Selector 
Load Into Segment Register) 


Limit per Seg Descr 


Limit per Seg Descr 


Other Attributes 
(Updated During Selector 
Load Into Segment Register} 


Per segment descriptor, but descriptor must indicate “readable” to avoid Exception 13 (special case for SS) 


= Per segment descriptor; descriptor must indicate “present” to avoid Exception 11 (Exception 12 in case of SS) 
= Per segment descriptor, but descriptor must indicate “writeable” to avoid Exception 13 (special case for SS) 
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Figure 24. Segment Descriptor Caches for Protected Mode (Loaded per Descriptor) 


Key: Y = Fixed Yes 
N = Fixed No 
d = Per segment descriptor 
p 
r 
w 
- = Does not apply to that segment cache register 
Protection 


Protection Concepts 


The Am386DE microprocessor has four levels of 
protection that are optimized to support the needs of a 
multitasking operating system to isolate and protect 
user programs from each other and the operating sys- 
tem. The privilege levels control the use of privileged 
instructions, I/O instructions, and access to segments 
and segment descriptors. 


The four-level hierarchical privilege system is illustrated 
in Figure 25. lt is an extension of the user/supervisor 
privilege mode commonly used by minicomputers. The 
privilege levels (PL) are numbered 0 through 3. Level 0 
is the most privileged or trusted level. 


Rules of Privilege 


The Am386DE microprocessor controls access to both 
data and procedures between levels of a task, accord- 
ing to the following rules. 


* Data stored inasegment with privilege level p canbe 
accessed only by code executing at a privilege level 
at least as privileged as p. 

« Acode segment/procedure with privilege level p can 
only be called by a task executing at the same or a 
lesser privilege level than p. 


Privilege Levels 
Task Privilege 


At any point in time, a task on the Am386DE micrapro- 
cessor always executes at one of the four privilege lev- 
els. The Current Privilege Level (CPL) specifies the 
task's privilege level. Atask’s CPL may only be changed 
by control transfers through gate descriptors to a code 
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segment with a different privilege level (see the Privilege 
Level Transfers section on page 49). Thus, an applica- 
tion program running at PL = 3 may call an operating sys- 
tem routine at PL = 1 (via a gate) that would cause the 
task’s CPL to be set to 1 until operating system routine is 
finished. 


Selector Privilege (RPL) 


The privilege level of a selector is specified by the RPL 
field. The RPL is the two least significant bits of the 
selector. The selector’s RPL is only used to establish a 
less trusted privilege level than the current privilege 
level for the use of a segment. This level is called the 
task’s effective privilege level (EPL). The EPL is defined 
as being the least privileged (i.e., numerically larger) 
levelof atask’s CPL and a selector's RPL. Thus, if selec- 
tor's RPL = 0, then the CPL always specifies the privi- 
lege level for making an access using the selector. On 
the other hand if RPL = 3, then a selector can only 
access segments at level 3 regardless of the task's CPL. 
The RPL is most commonly used to verify that pointers 
passed to an operating system procedure do not access 
data that is of higher privilege than the procedure that 
originated the pointer. Since the originator of a selector 
can specify any RPL value, the Adjust RPL (ARPL) 
instruction is provided to force the RPL bits to the origi- 
nator’s CPL. 


V/O Privilege and I/O Permission Bitmap 


The I/O privilege level (IOPL, a 2-bit fieldin the EFLAGS 
register) defines the least privileged level at which I/O 
instructions can be unconditionally performed. VO 
instructions can be unconditionally performed when 
CPL < IOPL. (The I/O instructions are IN, OUT, INS, 
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Figure 25. Four-Level Hierarchical Protection 
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OUTS, REP INS, and REP OUTS.) When CPL > IOPL, 
and the current task is associated with a 286 TSS, 
attempted I/O instructions cause an Exception 13 fault. 
When CPL > IOPL, and the current task is associated 
with an Am386DE CPU TSS, the I/O Permission Bitmap 
(part of an AM386DE microprocessor TSS) is consulted 
on whether I/O to the port is allowed, or an Exception 13 
fauit is to be generated instead. For diagrams of the I/O 
Permission Bitmap, refer to Figure 26. For further 
information on how the I/O Permission Bitmap is used in 
Protected Mode, refer to the Protection section on page 
45 and I/O Permission Bitmap section on page 46. 


The I/O privilege level (1OPL) also affects whether sev- 
eral other instructions can be executed or cause an 
Exception 13 fault instead. These instructions are called 
1OPL-sensitive instructions and they are CLI and STI. 
(Note that the LOCK prefix is not 1OPL-sensitive on the 
Am386DE microprocessor.) 


The IOPL also affects whether the IF bit (interrupts 
enable flag) can be changed by loading a value into the 
EFLAGS register. When CPL < IOPL, the IF bit can be 
changed by loading a new value into the EFLAGS regis- 
ter. When CPL > IOPL, the IF bit cannot be changed by a 
new value POP’ed into (or otherwise loaded into) the 
EFLAGS register; the IF bit merely remains unchanged 
and no exception is generated. 


Privilege Validation 


The Am386DE CPU provides several instructions to 
speed pointer testing and help maintain system integrity 
by verifying that the selector value refers to an appropriate 
segment. Table 13 summarizes the selector validation 
procedures available for the Am386DE microprocessor. 


This pointer verification prevents the common problem of 
an application at PL = 3 calling an operating-systems rou- 
tine at PL = 0 and passing the operating-systems routine 
a bad pointer that corrupts a data structure belonging to 
the operating system. If the operating-systems routine 
uses the ARPL instruction to ensure that the RPL of the 
selector has no greater privilege than that of the caller, 
then this problem can be avoided. 


Descriptor Access 


There are basically two types of segment accesses: 
those involving code segments, such as control trans- 
fers; and those involving data accesses. Determining 
the ability of a task to access a segment involves the 
type of segment to be accessed, the instruction used, 
the type of descriptor used, and CPL, RPL, and DPL as 
described above. 


Any time an instruction loads data segment registers 
(DS, ES, FS, GS) the Am386DE microprocessor makes 
protection validation checks. Selectors loaded in the 
DS, ES, FS, GS registers must refer only to data seg- 
ments or readable code segments. The data access 
rules are specified in the Rules of Privilege section. The 
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only exception to those rules is readable contorming code 
segments which can be accessed at any privilege level. 


Finally, the privilege validation checks are performed. 
The CPL is compared to the EPL; if the EPL is more priv- 
ileged than the CPL, an Exception 13 (General Protec- 
tion fault) is generated. 


The rules regarding the stack segment are slightly dif- 
ferent than those involving data segments. Instruc- 
tions that load selectors into SS must refer to data 
segment descriptors for writeable data segments. The 
DPL and RPL must equal the CPL. All other descriptor 
types or aprivilege level violation will cause Exception 
13. A stack not present fault causes Exception 12. 
Note that an Exception 11 is used-for a not-present 
code or data segment. 


Privilege Level Transfers 


Intersegment control transfers occur when a selector is 
loaded in the CS register. For a typical system most of 


Table 13. Pointer Test Instructions 


Instruction Operands Function 


ARPL Selector, Adjust Requested Privilege 
Register Level; adjusts the APL of the 
selector to the numeric maximum 
of current selector RPL value and 
the RPL value in the register. Set 
zero flag if selector RPL was 
changed. 


VERity for Read: sets the zero 
flag # the segment referred to 
by the selector can be read. 


VERify for Write: sets the zero 
flag if the segment referred to 
by the selector can be written. 


VERR Selector 


VERW 


Selector 


Register, 
Selector 


Load Segment Limit: reads the 
segment limit into the register if 
privilege rules and descriptor 
type allow. Set zero flag if 
successful. 


Register, 
Selector 


Load Access Rights: reads the 
descriptor access rights byte into 
the register if privilege rules allow. 
Set zero flag if successful. 


18421A-029 


these transfers are simply the result of a call or a jump to 
another routine. There are five types of control transfers, 
which are summarized in Table 14. 


Many of these transfers result in a privilege level trans- 
fer. Changing privilege levels is done only via control 
transfers by using gates, task switches, and interrupt or 
trap gates. 


Control transfers can only occur if the operation that 
loaded the selector references the correct descriptor 
type. Any violation of these descriptor usage rules will 
cause an Exception 13 (e.g., JMP through a call gate or 
IRET from a normal subroutine call). 


In order to provide further system security, all control 
transfers are also subject to the privilege rules. 


The privilege rules require that: 


1. Privilege level transitions can only occur via gates. 


2. JMPs can be made to a non-conforming code 
segment with the same privilege or to a conforming 
code segment with greater or equal privilege. 


3. CALLs can be made to a non-conforming code 
segment with the same privilege or via a gate to a 
more privileged level. 


4. Interrupts handled within the task obey the same 
privilege rules as CALLs. 


5. Conforming Code segments are accessible by 
privilege levels that are the same or less privileged 
than the conforming-code segment's DPL. 


6. Both the requested privilege level (RPL) in the 
selector pointing to the gate and the task's CPL must 
be of equal or greater privilege than the gate’s DPL. 


7. The code segment selected in the gate must be the 
same or more privileged than the task’s CPL. 


8. Return instructions that do not switch tasks can only 
return control to a code seqment with same or less 
privilege. 

9. Task switches can be performed by a CALL, JMP, or 
INT that references either a task gate or task state 
segment whose DPL is less privileged or the same 
privilege as the old task's CPL. 
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Any control transfer that changes CPL within a task 
causes a change of stacks as a result of the privilege 
level change. The initial values of SS:ESP for privilege 
levels 0, 1, and 2 are retained in the task state segment 
(see the Task Switching section). During a JMP or CALL 
control transfer, the new stack pointer is loaded inthe SS 
and ESP registers and the previous stack pointer is 
pushed onto the new stack. 


When returning to the original privilege level, use of the 
lower-privilege stack is restored as part of the RET or 
IRET instruction operation. For subroutine calls that 
pass parameters onthe stack and cross privilege levels, 
a fixed number of words (as specified in the gate's word 
count field) are copied from the previous stack to the cur- 
rent stack. The interseqment RET instruction with a 
stack adjustment value will correctly restore the pre- 
vious stack pointer upon return. 


Cail Gates 


Gates provide protected, indirect CALLs. One of the 
major uses of gates is to provide a secure method of 
privilege transfers within a task. Since the operating sys- 
tem defines all the gates in a system, it can ensure that 
all gates only allow entry into a few trusted procedure s 
(such as those that allocate memory or perform I/O) 


Gate descriptors follow the data access rules of 5 rivi- 
lege; that is, gates can be accessed by a task if the EPL 
is equal to or more privileged than the gate descri-tor's 
DPL. Gates follow the control transfer rules of privilege 


Pre elie may only transfer control to a more privi- 
legee 


evel. 


Cail gates .ve accessed via a CALL instruction: and are 
syntactically ‘dentical to calling a normal subroutine. 
When an interl’vel AM386DE microprocessor call gate 
is activated, the iallowing actions occur: 


AMD cl 


Load CS:EIP from gate check for validity; 
SS is pushed zero-extended to 32 bits; 
ESP is pushed; 


Copy word count 32-bit parameters from the old 
stack to the new stack; 


5. Push return address on stack. 


PONS 


The procedure is identical for 80286 Call gates, except 
that 16-bit parameters are copied and 16-bit registers 
are pushed. 


Interrupt gates and Trap gates work in a similar fashion 
as the call gates, except there is no copying of parame- 
ters. The only difference between Trap and Interrupt 
gates is that control transfers through an Interrupt gate 
disable further interrupts (i.e., the IF bit is set to 0) and 
Trap gates leave the interrupt status unchanged. 


Task Switching 


Avery important attribute of any multitasking/multi-user 
operating system is its ability to rapidly switch between 
tasks or processes. The Am386DE microprocessor 
directly supports this operation by providing a task 
switch instruction in hardware. The AM386DE CPU task 
switch operation saves the entire state of the machine 
(all of the registers, address space, and a link to the pre- 
vious task), loads a new execution state, performs 
protection checks, and commences execution in the 
new task, in about 17 ms. Like transfer of control via 
gates, the task switch operation is invoked by executing 
anintersegment JMP or CALL instruction that refers to a 
Task State Segment (TSS), or a task gate descriptor in 
the GDT or LOT. An INT n instruction, exception, trap, or 
external interrupt may also invoke the task switch opera- 
tion if there is a task gate descriptor in the associated 
IDT descriptor slot. 


Table 14. Cescriptor Types Used for Control Transfer 


Control Transfer Types 


Intersegment within the sa ne privilege level 


Intersegment to the same _r higher privile Je level 


Interrupt within task may chane CPL 


Descriptor Descriptor 
Operation Types Referenced Table 
JMP, CALL, RET, IRET* Code Segment GOT/LDT 
CALL Call Gate GDT/LDOT 


Intersegment to a lower privilege leve ‘ ‘change task CPL) | RET, IRET* Code Segment GDT/LDT 
CALL, JMP Task State Segment 
CALL, JMP GDT/LDT 


IRET**, Interrupt Instruction, 
ae silt mene 


Task Switch 


Note: 
*NT (Nested Task bit of flag register) = 0 
*NT (Nested Task bit of flag register) = 1 


Interrupt Instruction, Trap or Interrupt 
Exception, External Interrupt] Gate IDT 
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The TSS descriptor points to a segment (see 


Figure 26a) containing the entire AM386DE micropro- 
cessor execution state while a task gate descriptor con- 
tains a TSS selector. The AM386DE CPU supports both 
80286 and Am386DE CPU style TSSs. Figure 27 
shows an 80286 TSS. The limit of an AM386DE micro- 
processor TSS must be greater than 0064H (002BH for 
an 80286 TSS) and can be as large as 4 Gbytes. In the 
additional TSS space, the operating system is free to 
store additional information, such as the reason the task 
is inactive, time the task has spent running, and open 
files belonging to the task. 


Each task must have a TSS associated with it. The cur- 
rent TSS is identified by a special register in the 
Am386DE microprocessor called the Task State Seg- 
ment Register (TR). This register contains a selector 
referring to the task state segment descriptor that 
defines the current TSS. A hidden base and limit register 
associated with TR are loaded whenever TR is loaded 
with a new selector. Returning from a task is accom- 
plished by the RET instruction. When |RET is executed, 
control is returned to the task that was interrupted. The 
current executing task’s state is saved in the TSS and 
the old task state is restored from its TSS. 
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Figure 27. 80286 TSS 


Several bits in the flag register and machine status word 
(CRO) give information about the state of a task that is 
useful to the operating system. The Nested Task (NT) 
(bit 14 in EFLAGS) controls the function of the IRET 
instruction. if NT = 0, the IRET instruction performs the 
regular return; when NT = 1, IRET performs a task 
switch operation back to the previous task. The NT bitis 
set or reset in the following fashion. 


When a CALL or INT instruction initiates a task switch, 
the new TSS will be marked busy and the back link field 
of the new TSS setto the old TSS selector. The NT bit of 
the new task is set by CALL or INT initiated task 
switches. An interrupt that does not cause a task switch 
will clear NT. (The NT bit will be restored after execution 
of the interrupt handler.) NT may also be set or cleared 
by POPF or IRET instructions. 


The Am386DE microprocessor Task State Segment is 
marked busy by changing the descriptor type field from 
Type 9H to Type BH. An 80286 TSS is marked busy by 
changing the descriptor type field from Type 1 to Type 3. 
Use of a selector that references a busy task state seg- 
ment causes an Exception 13. 


The coprocessor’s state is not automatically saved 
when a task switch occurs, because the incoming task 
may not use the coprocessor. The Task Switched (TS) 
Bit (bit 3 in the CRO) helps deal with the coprocessor's 
state in a multitasking environment. Whenever the 
Am386DE microprocessor switches tasks, it sets the TS 
bit. The Am386DE CPU detects the first use of a proces- 
sor extension instruction after a task switch and causes 
the processor extension not available Exception 7. The 
exception handler for Exception 7 may then decide 
whether to save the state of the coprocessor. A proces- 
sor extension not present Exception 7 will occur when 
attempting to execute an ESC or WAIT instruction if the 
Task Switched and Monitor coprocessor extension bits 
are both set (i.e., TS = 1 and MP = 1). 


The T bit in the AM386DE microprocessor TSS indicates 
that the processor should generate a debug exception 
when switching to a task. If T = 1, then upon entry to a 
new task, a debug Exception 1 will be generated. 


InRtialization and Transition to Protected Mode 
Since the AmM386DE microprocessor begins executing 
in Real Mode immediately after RESET, it is necessary 
to initialize the system tables and registers with the 
appropriate values. 


The GDT and IDT registers must refer to a valid GOT 
and IDT. The IDT should be at least 256 bytes long, and 
GDT must contain descriptors for the initial code and 
data segments. Figure 28 shows the tables and 
Figure 29 shows the descriptors needed for a simple 
Protected Mode Am386DE microprocessor system. It 
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has a single code and single data/stack segment each 4 
Gbytes long and a single privilege level PL = 0. 


The actual method of enabling Protected Mode is to load 
CRO with PE bit set, via the MOV CRO, R/M instruction. 
This puts the AmM386DE microprocessor in Protected 
Mode. 


After enabling Protected Mode, the next instruction 
should execute anintersegment JMP to load the CS reg- 
ister and flush the instruction decode queue. The final 
Step is to load all of the data segment registers with the 
initial selector values. 


a \ 


Base Address 


Ss 


GS 


FS 


ES 


DS 


cs 


GDTR 


IDTR F] Limit 


0 
00000000 


Base Address 


Code Descriptor 


An alternate approach to entering Protected Mode that 
is especially appropriate for multitasking operating sys- 
tems is to use the built-in task-switch to load all of the 
registers. In this case, the GDT would contain two TSS 
descriptors in addition to the code and data descriptors 
needed for the first task. The first JMP instruction in Pro- 
tected Mode would jump to the TSS causing a task 
switch and loading all of the registers with the values 
stored inthe TSS. The TR should be initialized to point to 
a valid TSS descriptor since a task switch saves the 
State of the current task in a task state segment. 
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Figure 28. Simple Protected System 
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Figure 29. GDT Descriptors for Simple System 
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FUNCTIONAL DATA 
Introduction 


The AmM386DE microprocessor features a straight for- 
ward functional interface to the external hardware. The 
Am386DE CPU has separate parallel buses for data 
and address. The data bus is 32 bits in width and is 
bidirectional. The address bus outputs 32-bit address 
values in the most directly usable form for the high- 
speed local bus: 4 individual Byte Enable signals and 
the 30 upper-order bits as a binary value. The data and 
address buses are interpreted and controlled with their 
associated control signals. 


A dynamic data bus sizing feature allows the processor 
to handle a mix of 32- and 16-bit external buses on a 
cycle-by-cycle basis (see Dynamic Data Bus Sizing on 
page 59). If 16-bit bus size is selected, the Am386DE 
microprocessor automatically makes any adjustment 
needed, even performing another 16-bit bus cycle to 
complete the transfer if that is necessary. Any 8-bit 
peripheral devices may be connected to 32- or 16-bit 
buses with no loss of performance. Anew address pipe- 
lining option is provided and applies to 32- and 16-bit 
buses for substantially improved memory utilization, 
especially forthe most heavily used memory resources. 


The address pipelining option, when selected, typically 
allows a given memory interface to operate with one less 
wait state than would otherwise be required (see 
Address Pipelining on page 63). The pipelined bus is 
also well suited to interleaved memory designs. When 
address pipelining is requested by the external hard- 
ware, the AmM386DE microprocessor will output the 
address and bus cycle definition of the next bus cycle (if 
it is internally available) even while waiting for the cur- 
rent cycle to be acknowledged. 


Non-pipelined address timing, however, is ideal for 
external cache designs, since the cache memory will 
typically be fast enough to allow non-pipelined cycles. 
For maximum design flexibility, the address pipelining 
option is selectable on a cycle-by-cycle basis. 


The processor's bus cycle is the basic mechanism for 
information transfer, either from system to processor or 
from processor to system. AmM386DE microprocessor 
bus cycles perform data transfer in a minimum of only 
two clock periods. On a 32-bit data bus, the maximum 
Am386DE device transfer, at 33-MHz bandwidth is 66 
Mbytes/second. Any bus cycle will be extended for more 
than two clock periods, however, if external hardware 
withholds acknowledgment of the cycle. At the appropri- 
ate time, acknowledgment is signaled by asserting the 
Am386DE microprocessor READY input. 


The Am386DE CPU can relinquish control of its local 
buses to allow mastership by other devices, such as 
direct memory access channels. When relinquished, 
HLDA is the only output pin driven by the AmM386DE 


microprocessor providing near-complete isolation of the 
processor from its system. The near-complete isolation 
characteristic is ideal when driving the system trom test 
equipment and in fault-tolerant applications. 


Functional data covered in this section describes the 
processor's hardware interface. First, the set of signals 
available at the processor pins is described (see Signal 
Description on page 63). Following that are the signal 
waveforms occurring during bus cycles (see Bus Trans- 
fer Mechanism on page 58, Bus Functional Description 
on page 61, and Other Functional Descriptions on 
page 78). 


Signal Description 


Introduction 


Ahead is a brief description of the AM386DE CPU input 
and output signals arranged by functional groups (see 
Figure 30). 


Example signal: 


M/IO —High voltage indicates Memory selected 
—tLow voltage indicates I/O selected 


The signal descriptions sometimes refer to AC timing 
parameters, such as t25 RESET Setup Time and t26 
RESET Hold Time. 


Clock (CLK2) 


CLK2 provides the fundamental timing for the 
Am386DE microprocessor. It is divided by 2 internally to 
generate the intemal processor clock used for instruc- 
tion execution. The internal clock is comprised of two 
phases, phase one and phase two. Each CLK2 period is 
a phase of the internal clock. Figure 31 illustrates the 
relationship. If desired, the phase of the internal proces- 
sor clock can be synchronized to a known phase by 
ensuring the RESET signal falling edge meets its appli- 
cable setup and hold times, t25 and t26. 


Data Bus (D31-D0) 


These three-state, bidirectional signals provide the gen- 
eral purpose data path between the Am386DE micro- 
processor and other devices. Data bus inputs and 
outputs indicate 1 when High. The data bus can transfer 
data on 32- and 16-bit buses using a data bus sizing fea- 
ture controlled by the BS16 input. See the Bus Control 
section on page 54. Data bus reads require that read 
data setup and hold times, t21 and t22, be met for cor- 
rect operation. In addition, the Am386DE microproces- 
sor requires that all data bus pins be at a valid togic state 
(High or Low) at the end of each read cycle, when 
READY is asserted. During any write operation (and 
during halt cycles and shut down cycles), the AM386DE 
microprocessor always drives all 32 signals of the data 
bus even if the current bus size is 16 bits. 


rr 
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Figure 31. CLK2 Signal and Internal Processor Clock 
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Bus Cycle Definition Signals (W/R, D/C, M/TO, 


These three-state outputs provide physical memory 
addresses or i/O port addresses. The address bus is 
capable of addressing 4 Gbytes of physical memory 
space (QQO00000H-FFFFFFFFH), and 64 Kbytes of I/O 
address space (00000000H—O000FFFFH) for pro- 
grammed I/O. I/O transfers automatically generated for 
Am386DE _ microprocessor-to-coprocessor commu- 
nication use I/O addresses 800000F8H-800000FFH, 
so A311 is High in conjunction with M/O Low allows sim- 
ple generation of the coprocessor select signal. 


The Byte Enable outputs, BE3—-BE0, directly indicate 
which bytes of the 32-bit data bus are involved with the 
current transfer. This is most convenient for external 
hardware. 


BEO applies to D7-DO 


BE1 applies to D15-D8 
BE2 applies to D23-D16 
BES applies to D31-D24 


The number of Byte Enables asserted indicates the 
physical size of the operand being transferred (1, 2,3, or 
4 bytes). Refer to the Operand Alignment section on 
page 61. 


When a memory write cycle or I/O write cycle is in prog- 
ress and the operand being transferred occupies only 
the upper 16 bits of the data bus (D31-D16), duplicate 
data is simultaneously presented on the corresponding 
lower 16 bits of the data bus (D15—D0). This duplication 
is performed for optimum write performance on 16 bit 
buses. The pattern of write data duplication is a function 
of the Byte Enables asserted during the write cycle. 
Table 15 lists the write data present on D31—D0, as a 
function of the asserted Byte Enable outputs BE3—BE0. 


Tock) 


These three-state outputs define the type of bus cycle 
being performed. W/R distinguishes between write and 
read cycles, D/C between data and control cycles, M/iO 
between memory and I/O cycles, and LOCK between 
locked and unlocked bus cycles. 


The primary bus cycle definition signals are W/R, D/C, 
and M/IO, since these are the signals driven valid as the 
ADS (Address Status output) is driven asserted. The 
LOCK is driven valid at the same time as the first locked 
bus cycle begins, which due to address pipelining, could 
be later than ADS is driven asserted. See Pipelined 
Address. The LOCK is negated when the READY input 
terminates the last bus cycle that was locked. 


Exact bus cycle definitions, as a function of W/R, D/C, 
and M/O, are given in Table 16. Note one combination 
of W/R, D/C, and MAO is never given when ADS is 
asserted (however, that combination, which is listed as 
does not occur, may occur during idle bus states when 
ADS is not asserted). If M/1O, D/C, and W/R are qualitied 
by ADS asserted, then a decoding scheme may be sim- 
plified by using this definition. 


Bus Control Signals (ADS, READY, NA, BS16) 
Introduction 


The following signals allow the processor to indicate 
when bus cycle has begun and allow other system hard- 
ware to control address pipelining, data bus width, and 
bus cycle termination. 


Address Status (ADS) 
This three-state output indicates that a valid bus cycle 


definition and address (W/R, D/C, M/IO, BE3-BEO, and 
A31-A2) is being driven at the Am386DE 


Table 15. Write Data Duplication as a Function of BE3-BEO 


Am386DE CPU Byte Enables Am386DE CPU Write Data 
BES BE2 BET BEO D31-D24 


Automatic 


D23-D16 Duplication? 


Undef 
Undef 
Cc 
Undef 


D15-D8 D7-D0 


Undef 
Cc 
c 


Key: D = Logical Write Data D31-D24 B= Logical Write Data D15—-D8 
C Logical Write Data D23-D16 A = Logical Write Data D7-Do 
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Table 16. Bus Cycie Definition 


Low Does Not Occur 
[tow | High | tow [V0 Data Read 
[tow [Rah gh 10 ata Wt 7 


Shutdown: 
Address = 0 


BEO Low 


BET High 
BE2 High 
BES High 


High Low High Halt: 
Address = 2 
BEO High 
BET High 
BE2 Low 
BES High 
A31-A2 Low 


A31-A2 Low 


Merry Data Read Some Gyces 


microprocessor pins. It is asserted during T1 and T2P 
bus states (see Non-Pipelined Address on page 66 and 
Pipelined Address on page 70 for additional information 
on bus states). 


Transfer Acknowledge (READY) 


This input indicates the current bus cycle is complete, 
and the active bytes indicated by BE3-BE0 and BS16 
are accepted or provided. When READY is sampled 
asserted during a read cycle or interrupt acknowledge 
cycle, the AmM386DE microprocessor latches the input 
data and terminates the cycle. When READY is sampled 
asserted during a write cycle, the processor terminates 
the bus cycle. 


READY is ignored on the first bus state of all bus cycles, 
and sampled each bus state thereafter until asserted. 
READY must eventually be asserted to acknowledge 
every bus cycle, including Hatt Indication and Shutdown 
Indication bus cycles. When being sampled, READY 
must always meet setup and hold times, t19 and t20, for 
correct operation. See all sections of Bus Functional 
Description starting on page 61. 


Next Address Request (NA) 


This is used to request address pipelining. This input 
indicates the systemis prepared to accept new values of 
BE3-BE0, A31-A2, W/R, D/C, and M/IO from the 
Am386DE microprocessor even if the end of the current 
cycle is not being acknowledged on READY. If this input 
is asserted when sampled, the next address is driven 
onto the bus provided the next bus request is already 
pending intemally. See Address Pipelining on page 63 
and Read and Write Cycles on page 65. NA must always 
meet setup and hold times, t15 and t16, for correct 
operation. 


Bus Size 16(BS16) 


The BS16 feature allows the AM386DE microprocessor 
to directly connect to 32- and 16-bit data buses. Assert- 
ing this input constrains the current bus cycle to use only 
the lower-order half (D15—D0) of the data bus, corre- 
sponding to BEO and BET. Asserting BS16 has no addi- 
tional eftect if only BEO and/or BE1 are asserted in the 
current cycle. However, during bus cycles asserting 
BE2 or BES, asserting BS16 will automatically cause the 
Am386DE microprocessor to make adjustments for cor- 
rect transfer of the upper byte(s) using only physical 
data signals D15—D0. 


If the operand spans both halves of the data bus and 
BS16 is asserted, the Am386DE microprocessor will 
automatically perform another 16-bit bus cycle. BS16 
must always meet setup and hold times, t17 andt18, for 
correct operation. 


Am386DE CPU I/O cycles are automatically generated 
for coprocessor communication. Since the Am386DE 
microprocessor must transfer 32-bit quantities between 
itself and a 387DX math coprocessor, BS16 must not be 
asserted during 387DX math coprocessor communica- 
tion cycles. 


Bus Arbitration Signals (HOLD, HLDA) 
introduction 


This section describes the mechanism by which the pro- 
cessor relinquishes control of its local buses when 
requested by another bus master device. See Entering 
and Exiting Hold Acknowledge on page 78 for additional 
information. 


Bus Hold Request (HOLD) 


This input indicates some device other than the 
Am386DE CPU requires bus mastership. 
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HOLD must remain asserted as long as any other device 


is a local bus master. HOLD is not recognized while 
RESET is asserted. If RESET is asserted while HOLD is 
asserted, RESET has priority and places the bus into an 
idle state, rather than the hold acknowledge (high 
impedance) state. HOLD is level-sensitive and is a syn- 
chronous input. HOLD signals must always meet setup 
and hold times, t23 and t24, for correct operation. 


Bus Hold Acknowledge (HLDA) 


Assertion of this output indicates the Am386DE micro- 
processor has relinquished control of its local bus in 
response to HOLD asserted, and is in the Bus Hold 
Acknowledge state. 


The Hold Acknowledge state offers near-complete sig- 
nal isolation. In the Hold Acknowledge state, HLDA is 
the only signal being driven by the Am386DE micropro- 
cessor. The other output signals or bidirectional signals 
(D31-D0, BE3-BE0, A31~—A2, W/R, D/C, M0, LOCK, 
and ADS) are in a high-impedance state so the request- 
ing bus master may control them. Puil-up resistors may 
be desired on several signals to avoid spurious activity 
when no bus master is driving them. See Resistor Rec- 
ommendations. Also, one rising edge occurring on the 
NMI input during Hold Acknowledge is remembered for 
processing after the HOLD input is negated. 


In addition to the normal usage of Hold Acknowledge 
with DMA controllers or master peripherals, the near- 
complete isolation has particular attractiveness during 
system test when test equipment drives the system and 
in hardware-fault-tolerant applications. 


Coprocessor Interface Signals (PEREQ, BUSY, 
ERROR) 


Introduction 


Inthe following sections are descriptions of signals dedi- 
cated to the coprocessor interface. In addition to the 
data bus, address bus, and bus cycle definition signals, 
these following signals control communication between 
the Am386DE microprocessor and its 387DX math 
coprocessor extension. 


Coprocessor Request (PEREQ) 


When asserted, this input signal indicates a coproces- 
sor request for a data operand to be transferred to/from 
memory by the AM386DE microprocessor. In response, 
the Am386DE CPU transfers information between the 
coprocessor and memory. Because Am386DE micro- 
processor has internally stored the coprocessor op- 
code being executed, it performs the requested data 
transfer with the correct direction and memory address. 


PEREQ is level-sensitive and is allowed to be asynchro- 
nous to the CLK2 signal. 


Coprocessor Busy (BUSY) 


When asserted, this input indicates the coprocessor is 
still executing an instruction and is not yet able to accept 


another. When the Am386DE microprocessor encoun- 
ters any coprocessor instruction that operates on the 
numeric stack (e.g., load, pop, or arithmetic operation) 
or the WAIT instruction, this input is first automatically 
sampled until it is seen to be negated. This sampling of 
the BUSY input prevents overrunning the execution of a 
previous coprocessor instruction. 


The FNINIT and FNCLEX coprocessor instructions are 
allowed to execute even if BUSY is asserted, since 
these instructions are used for coprocessor initialization 
and exception-clearing. 


BUSY is level-sensitive and is allowed to be asynchro- 
nous to the CLK2 signal. 


BUSY serves an additional function. If BUSY is sampled 
Low at the falling edge of RESET, the AmM386DE micro- 
processor performs an internal self-test (see the Bus 
Activity During and Following Reset section on page 
84). If BUSY is sampled High, no self-test is performed. 


Coprocessor Error (ERROR) 


This input signal indicates that the previous coprocessor 
instruction generated a coprocessor error of a type not 
masked by the coprocessor's control register. This input 
is automatically sampled by the Am386DE micropro- 
cessor when a coprocessor instruction is encountered, 
and if asserted, the Am386DE device generates Excep- 
tion 16 to access the error-handling software. 


Several coprocessor instructions, generally those that 
clear the numeric error flags in the coprocessor or save 
coprocessor state, do execute without the Am386DE 
microprocessor generating Exception 16 even if 
ERROR is asserted. These instructions are FNINIT, 
FNCLEX, FSTSW, FSTSWAX, FSTCW, FSTENV, 
FSAVE, FESTENV, and FESAVE. 


ERROR is level-sensitive andis allowed to be asynchro- 
nous to the CLK2 signal. 


Interrupt Signals (INTR, NMI, RESET) 
introduction 


The following descriptions cover inputs that can inter- 
rupt or suspend execution of the processor's current 
instruction stream. 


Maskable interrupt Request (INTR) 


When asserted, this input indicates a request for inter- 
rupt service, which can be masked by the Am386DE 
CPU Fiag Register IF bit. When the Am386DE micro- 
processor responds to the INTR input, it performs two 
interrupt acknowledge bus cycles, and at the end of the 
second, latches an 8-bit interrupt vector on D17-D0 to 
identify the source of the interrupt. 


INTR is level-sensitive and is allowed to be asynchro- 
nous to the CLK2 signal. To assure recognition of an 
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INTR request, INTR should remain asserted until the 
first interrupt acknowledge bus cycle begins. 


Non-Maskable Interrupt Request (NMI) 


This input indicates a request for interrupt service, which 
cannot be masked by software. The non-maskable 
interrupt request is always processed according to the 
pointer or gate in slot 2 of the interrupt table. Because of 
the fixed NMI slot assignment, no interrupt acknowledge 
cycles are performed when processing NMI. 


NM lis rising edge-sensitive and is allowed to be asynch- 
ronous to the CLK2 signal. To assure recognition of NMI, 
it must be negated for at least eight CLK2 periods, and 
then be asserted for at least eight CLK2 periods. 


Once NMI processing has begun, no additional NMI’s 
are processed until after the next IRET instruction, 
which is typically the end of the NMI service routine. If 
NMlis re-asserted prior to that time, however, one rising 
edge on NMI will be remembered for processing after 
executing the next IRET instruction. 


Reset (RESET) 


This input signal suspends any operation in progress 
and places the Am386DE microprocessor in a known 
reset state. The AmM386DE device is reset by asserting 
RESET for 15 or more CLK2 periods (80 or more CLK2 
periods before requesting self-test). When RESET is 
asserted, all other input pins, except FLT, are ignored, 
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and all other bus pins are driven to an idle bus state as 
shown in Table 17. If RESET and HOLD are both 
asserted at a point in time, RESET takes priority even if 
the AmM386DE device was in a Hold Acknowledge state 
prior to RESET asserted. 


Table 17. Pin State (Idle Bus) During Reset 


High 
High Impedance 
Low 


High 


Low 
High 
Low 
High 
Low 


RESET is level-sensitive and must be synchronous to 
the CLK2 signal. If desired, the phase of the internal pro- 
cessor clock and the entire Am386DE CPU state can be 
completely synchronized to external circuitry by ensur- 
ing the RESET signal falling edge meets its applicable 
setup and hold times, t25 and t26. The signal summary 
is shown in Table 18. 


Table 18. Am386DE Microprocessor Signal Summary 


Clock 

Data Bus 

Byte Enables 

Address Bus 

Write-Read Indication 
Data-Control Indication 
Memory-!/O Indication 

Bus Lock Indication 
Address Status 

Next Address Request 

Bus Size 16 

Transfer Acknowledge 

Bus Hold Request 

Bus Hold Acknowledge 
Coprocessor Request 
Coprocessor Busy 
Copracessor Error 
Maskable Interrupt Request 
Non-Maskable Intrpt Request 
Reset 


Input Synch Output High 
or Asynch to Impedance During 
CLK2 HLDA? 


------o-+---o00000006 
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Bus Transfer Mechanism 


Introduction 


All data transfers occur as a result of one or more bus 
cycles. Logical data operands of byte, word, and Dword 
lengths may be transferred without restrictions on 
physical address alignment. Any byte boundary may be 
used, although two or eventhree physical bus cycles are 
performed as required for unaligned operand transfers. 
See Dynamic Data Bus Sizing on page 59 and Operand 
Alignment on page 61. 


The Am386DE microprocessor address signals are 
designed to simplify extemal system hardware. Higher- 
order address bits are provided by A31—A2. Lower-or- 
der address in the form of BE3-BE0 directly provides 
linear selects for the four bytes of the 32-bit data bus. 
Physical operand size information is thereby implicitly 
provided for each bus cycle in the most usable form. 


Byte Enable outputs, BE3-BEO, are asserted when their 
associated data bus bytes are involved with the present 
bus cycle, as listed in Table 19. During a bus cycle, any 
possible pattern of contiguous asserted Byte Enable 
outputs can occur, but never patterns having a negated 
Byte Enable separating two or three asserted 
Enables. 


Table 19. Byte Enables and Associated 
Data and Operand Bytes 


Byte Enable Signal| Associated Data Bus Signals 


D7-Do (Byte 0—least significant) 
D15~—D8 (Byte 1) 

D23-D16 (Byte 2) 

D31-D24 (Byte 3—most significant) 
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Address bits AO and Ai of the physical operand’s base 
address can be created when necessary (for instance, 
for MULTIBUS | or MULTIBUS II interface), as a function 
of the lowest-order asserted Byte Enable. This is shown 
by Table 20. Logic to generate AO and A1 is given by 
Figure 32. 


Each bus cycle is composed of at least two bus states 
and each bus state requires one processor clock period. 
Additional bus states added to a single bus cycle are 
called wait states. See Bus Functional Description 
section starting on page 61. 


Since a bus cycle requires a minimum of two bus states 
(equal to two processor clock periods), data can be 
transferred between external devices and _ the 
Am386DE CPU at a maximum rate of one 4-byte Dword 
every two processor clock periods, for a maximum bus 
bandwidth of 66 Mbytes/sec (AMS386DE microproces- 
sor operating at 33-MHz processor clock rate). 


Memory and I/O Spaces 


Bus cycles may access physical memory space or I/O 
space. Peripheral devices in the system may be either 
memory-mapped, or !/O-mapped, or both. As shown in 
Figure 33, physical memory addresses range from 
00000000H to FFFFFFFFH (4 Gbytes) and I/O 
addresses from 00000000H to OOOOFFFFH (64 Kbytes) 
for programmed I/O. Note the I/O addresses used by the 
automatic /O cycles for coprocessor communication 
are 800000F8H to 800000FFH, beyond the address 
range of programmed I/O, to allow easy generation of a 
coprocessor chip select signal using the A31 and M/iO 
signals. 


Memory and I/O Organization 


The Am386DE microprocessor datapath to memory 
and I/O spaces can be 32- or 16-bits wide. When 32-bits 
wide, memory and I/O spaces are organized naturally 
as arrays of physical 32-bit Dwords. Each memory or I/O 
Dword has four individually addressable bytes at 


Table 20. Generating A31-A0 from BE3—BE0 and A31~—A2 


Se CPU Address Signals 


[ee Base 
Address 


mee 4 


> Low [i | 
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K ~ Map for AO Signal 


a (8 a] |B 


amo if 


AO 


18421A~-035 


Figure 32. Logic to Generate AO, A1 from BE3—-BEO 


consecutive byte addresses. The lowest-addressed 
byte is associated with data signals D17—D0; the high- 
est-addressed byte with D31—-D24. 


The Am386DE microprocessor includes a bus control 
input, BS16, that also allows direct connection to 16-bit 
memory or I/O spaces organized as a sequence of 
16-bit word. Cycles to 32- and 16-bit memory or I/O 
devices may occur in any sequence, since the BS16 
control is sampled during each bus cycle. (See Dynamic 
Data Bus Sizing.) The Byte Enable signals, BE3-BEO, 
allow byte granularity when addressing any memory or 
/O structure, whether 32- or 16-bits wide. 


Dynamic Data Bus Sizing 

Dynamic Data Bus Sizing is a feature allowing direct 
processor connection to 32- or 16-bit data buses for 
memory or I/O. A single processor may connect to both 
size buses. Transfers to or from 32- or 16-bit ports are 
supported by dynamically determining the bus width 
during each bus cycle. During each bus cycle an 
address decoding circuit or the slave device itself may 
assert BS16 for 16-bit ports, or negate BS16 for 32-bit 
ports. 


With BS16 asserted, the processor automatically con- 
verts operand transfers larger than 16 bits, or mis- 
aligned 16-bit transfers, into two or three transfers as 
required. All operand transfers physically occur on 
D15-D0 when BS16 is asserted. Therefore, 16-bit 
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memories or I/O devices only connect on data signals 
D15~D0. No extra transceivers are required. 


Asserting BS16 only affects the processor when BE2 
and/or BES are asserted during the current cycle. If only 
D15—-D0 are involved with the transfer, asserting BS16 
has no effect since the transfer can proceed normally 
over a 16-bit bus whether BS16 is asserted or not. In 
other words, asserting BS16 has no effect when only the 
lower half of the bus is involved with the current cycle. 


There are two types of situations where the processor is 
affected by asserting BS16, depending on which Byte 
Enables are asserted during the current bus cycle. 


Upper Half Only: 
Only BE2 and/or BE3 asserted. 
Upper and Lower Hatt: 


At least BE1, BE2 asserted (and perhaps also 
BEO and/or BE3). 


Eftect of asserting BS16 during Upper Half Only read 
cycles: 


Asserting BS76 during Upper Half Only reads causes the 
Am386DE microprocessor to read data on the lower 16 
bits of the data bus and ignore data on the upper 16 bits of 
the data bus. Data that would have been read trom 
D31-D16 (as indicated by BE2 and BE3) will instead be 
read from D15—D0, respectively. 
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FFFFFFFFH Ys 
Not 
Accessible 
Physical 
Memory pasa alt _ Math Coprocessor 
4 Gbytes (See note} (387DX) 
Accessible 
OOOOFFFFH 
Accessible 
64 Kbytes Programmed 
00000000H 00000000H VO Space 


Physical Memory Space 


Note: 


VO Space 


Since A31 is High during automatic communication with coprocessor, A31 High and MAO Low can be used to 


easily generate a coprocessor select signal. 


Figure 33. Physical Memory and /O Spaces 
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Effect of asserting BS16 during Upper Half Only write 
cycles: 


Asserting BS16 during Upper Half Only writes does not 
affect the Am386DE microprocessor. When only BE2 
and/or BES are asserted during a Write cycle, the 
Am386DE microprocessor always duplicates data signals 
D31-016 onto D15—D0 (see Table 15). Therefore, no fur- 
ther Am386DE CPU action is required to perform these 
writas on 32- or 16-bit buses. 


Effect of asserting BS16 during Upper and Lower Half 
read cycles: 


Asserting BS16 during Upper and Lower Half reads 
causes the processor to perform two 16-bit read cycles for 
complete physical operand transfer. Bytes 0 and 1 (as indi- 
cated by BEO and BE?) are read on the first cycle using 
015-Do. Bytes 2 and 3 (as indicated by BE2 and BES) are 
read during the second cycle, again using D15-Do. 
D31-D16 are ignored during both 16-bit cycles. BEO and 
BET are always negated during the second 16-bit cycle. 
See Figure 43 Cycles 2 and 2A. 


Effect of asserting BS16 during Upper and Lower Halt 
write cycles: 


Asserting BS16 during Upper and Lower Half writes 
causes the Am386DE microprocessor to perform two 


16-bit write cycles for complete physical operand transfer. 
All bytes are available the first write cycle allowing external 
hardware to receive Bytes 0 and 1 (as indicated by BEO 
and BET) using D15—-D0. On the second cycle the 
Am386DE microprocessor duplicates Bytes 2 and 3 on 
D15~D0 and Bytes 2 and 3 (as indicated by BE2 and BES) 
are written using 015-D0. BEO and BET are always 
negated during the second 16-bit cycle. BST6 must be 
asserted during the second 16-bit cycle. See Figure 43 
Cycles 1 and 1A. 


Interfacing with 32- and 16-Bit Memories 


In 32-bit-wide physical memories such as Figure 34, 
each physical Dword begins at a byte address that is a 
multiple of 4. A31-A2 are directly used as a Dword 
selects and BE3—BE0 as byte selects. BS16 is negated 
for all bus cycles involving the 32-bit array. 


When 16-bit-wide physical arrays are included in the 
system, as in Figure 35, each 16-bit physical word 
begins at an address that is a multiple of 2. Note the 
address is decoded to assert BS16 only during bus 
cycles involving the 16-bit array. If desiring to use pipe- 
lined address with 16-bit memories, then BE3-BEO and 
W/R are also decoded to determine when BS16 should 
be asserted. (See Pipelined Address with Dynamic Data 
Bus Sizing on page 77.) 
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Figure 34. Am386DE Microprocessor with 32-Bit Memory 
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Figure 35. Am386DE Microprocessor with 32-Bit and 16-Bit Memory 


A31-A2 are directly usable for addressing 32- and 
16-bit devices. To address 16-bit devices, A1 and two 
Byte Enable signals are also needed. 


To generate an A1 signal and two Byte Enable signals 
for 16-bit access, BE3-BEO should be decoded as in 
Table 21. Note that certain combinations of BE3~BEO 
are never generated by the Am386DE microprocessor, 
leading to “don’t care” conditions in the decoder. Any 
BE3—BEO decoder, such as shown in Figure 36, may 
use the non-occurring BE3-BEO combinations to its 
best advantage. 


Operand Alignment 


With the flexibility of memory addressing on the 
Am386DE microprocessor, it is possible to transfer a 
logical operand that spans more than one physical 
Dword or Word of memory or I/O. Examples are 32-bit 
Dword operands beginning at addresses not evenly 


divisible by a 4- or a 16-bit Word operand split between 
two physical Dwords of memory array. 


Operand alignment and data bus size dictates when 
multiple bus cycles are required. Table 22 describes the 
transfer cycles generated for all combinations of logical 
operand lengths, alignment, and data bus sizing. When 
multiple bus cycles are required to transfer a multi-byte 
logical operand, the highest-order bytes are transferred 
first (but if BS16 asserted requires two 16-bit cycles be 
performed, that part of the transfer is lowest-order first). 


Bus Functional Description 


Introduction 


The Am386DE microprocessor has separate, parallel 
buses for data and address. The data bus is 32 bits in 
width and is bidirectional. The address bus provides a 
32-bit value using 30 signals for the 30 upper-order 
address bits and 4 Byte Enable signals to directly 
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Table 21. Generating A1, BHE, and BLE for Addressing 16-Bit Devices 


Am386DE CPU Signals 
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BLE asserted when D7—D0 of 16-bit bus is active. 
BHE asserted when D15—D8 of 16-bit bus is active. 
A! Low for all even words; A1 High for all odd words. 


X = “Don't Care” 
H = High voltage level 
L «= Low voltage level 


* 


Key: 
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Comments 


X—no active bytes 


X—non-contiguous bytes 


X—non-contiguous bytes 
X—non-contiguous bytes 
X—non-contiguous bytes 


X—non-contiguous bytes 


~roKO- uM KKM TOG KOO eK 


= Anon-occurring pattern of Byte Enables; either none are asserted 


or the pattern has Byte Enables asserted for non-contiguous bytes. 


indicate the active bytes. These buses are interpreted 
and controlled via several associated definition or con- 
trol signals. 


The definition of each bus cycle is given by three defini- 
tion signals: M/IO, W/R, and D/C. At the same time, a 
valid address is present on the Byte Enable signals 
BE3-BEO and other address signals, A31—A2. A status 
signal, ADS, indicates when the Am386DE CPU issues 
a new bus cycle definition and address. 


Collectively, the address bus, data bus, and all 
associated control signals are referred to simply as the 
bus. 


When active, the bus performs one of the bus cycles 
below. 


Read from memory space. 

Locked read from memory space. 
Write to memory space. 

. Locked write to memory space. 

Read from 1/0 space (or coprocessor). 
. Write to I/O space (or coprocessor). 

. Interrupt acknowledge. 

. Indicate halt or indicate shutdown. 


Table 16 shows the encoding of the bus cycle definition 
signals for each bus cycle. See the Bus Cycle Definition 
Signals section on page 54. 


The data bus has a dynamic sizing feature supporting 
32- and 16-bit bus size. Data bus size is indicated to the 
Am386DE microprocessor using its Bus Size 16 (BS16) 
input. All bus functions can be performed with either 
data bus size. 


When the Am386DE CPU bus is not performing one of 
the activities listed above, it is either Idle or in the Hold 
Acknowledge state, which may be detected by external 
circuitry. The Idle state can be identified by the 
Am386DE microprocessor giving no further assertions 
on its address strobe output (ADS) since the beginning 
of its most recent bus cycle, and the most recent bus 
cycle has been terminated. The Hold Acknowledge 
State is identified by the Am386DE CPU asserting its 
Hold Acknowledge (HLDA) output. 


The shortest time unit of bus activity is a bus state. Abus 
state is one processor clock period (two CLK2 periods) 
in duration. A complete data transfer occurs during a 
bus cycle, composed of two or more bus states. 


62 Am386DE Microprocessor 


PRELIMINARY 


BE1 


K — Map for 16-bit BLE signal (same as AO signal in Figure 32). 
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18421 A-039 


Figure 36. Logic to Generate A1, BHE, and BLE for 16-Bit Buses 


The fastest Am386DE microprocessor bus cycle 
requires only two bus states. For example, three con- 
secutive bus read cycles, each consisting of two bus 
states, are shown by Figure 37. The bus states in each 
cycle are named T1 and T2. Any memory or I/O address 
may be accessed by such a two-state bus cycle, if the 
external hardware is fast enough. The high-bandwidth, 
two-clock bus cycle realizes the full potential of fast main 
memory, or cache memory. 


Every bus cycle continues until it is acknowledged by the 
external system hardware, using the Am386DE micro- 
processor READY input. Acknowledging the bus cycle 
at the end of the first T2 results in the shortest bus cycle, 
requiring only T1 and T2. If READY is not immediately 
asserted, however, T2 states are repeated indefinitely 
until the READY input is sampled asserted. 


Address Pipelining 


The address pipelining option provides a choice of bus 
cycle timings. Pipelined or non-pipelined address timing 
is selectable on a cycle-by-cycle basis with the Next 
Address (NA) input. 


When address pipelining is not selected, the current 
address and bus cycle definition remain stable through- 
out the bus cycle. 


When address pipelining is selected, the address 
(BE3-BE0, A31—-A2) and definition (W/R, D/C, and 
M/1O) of the next cycie are available before the end of 
the current cycle. To signal their availability, the 
Am386DE microprocessor address status output (ADS) 
is also asserted. Figure 38 illustrates the fastest read 
cycles with pipelined address timing. 
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Table 22. Transfer Bus Cycles for Bytes, Words, and Dwords 


Physical Byte Address in Memory (low-order bits) 


Transfer Cycles over 32-bit Data Bus 


Transfer Cycles over 16-bit Data Bus 


Key: 

b = Byte transfer 

W = Word transfer 

| = low-order portion 
m = mid-order portion 


3 = 3-byte transfer 

d = Dword transfer 

h = high-order portion 
x = Don’ care 


Byte-Length of Logical Operand 


"For this case, 8086, 8088, 80186, 80188, 80286 transfer Ib first, then hb. 


Cycle 1 
Non-Pipelined 
(Read) 


CLK2 (input) 


M/A, BE3-BEo, 
A31-A2, D/C, W/R 
(Outputs) 


ADS (Output) 


NA 
(Input) 


(Input) 


LOCK 
(Output) 


[sme ee omen: Re es ee eee Ce ee | 


031—Do [ 
(Input during Read) 


Note: 
Fastest non-pipelined bus cycles consist of T1 and T2 


Figure 37. Fastest Read Cycles with Non-Pipelined Address Timing 


Note from Figure 38, the fastest bus cycles using pipe- 
lined address require only two bus states, named T1P 
and T2P. Therefore, cycles with pipelined address tim- 
ing allow the same data bandwidth as non-pipelined 
cycles, but address-to-data access time is increased 
compared to that of a non-pipelined cycle. 


By increasing the address-to-data access time, pipe- 
lined address timing reduces wait state requirements. 


Cycle 2 Cycle 3 
Non-Pipelined Non-Pipelined 
(Read) (Read) 


Valid 3 


= 
ae 


| Valid 3 
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For example, if one wait state is required with non-pipe- 
lined address timing, no wait states would be required 
with pipelined address. 


Pipelined address timing is useful in typical systems 
having address latches. In those systems, once an 
address has been latched, pipelined availability of the 
next address allows decoding circuitry to generate chip 
selects (and other necessary select signals) in advance, 
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(Input) 


LOCK 
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Note: 
Fastest pipelined bus cycles consist of T1P and T2P. 


Figure 38. Fastest Read Cycles with Pipelined Address Timing 


so selected devices are accessed immediately when 
the next cycle begins. In other words, the decode time 
forthe next cycle can be overlapped with the end of the 
current cycle. 


If a system contains a memory structure of two or more 
interleaved memory banks, pipelined address timing 
potentially allows even more overlap of activity. This is 
true when the interleaved memory controller is designed 
to allow the next memory operation to begin in one 
memory bank while the current bus cycle is still activat- 
ing another memory bank. Figure 39 shows the general 
structure of the Am386DE microprocessor with two- 
bank and four-bank interleaved memory. Note each 
memory bank of the interleaved memory has full data 
bus width (32-bit data width typically, unless 16-bit bus 
size is selected). 


Further details of pipelined address timing are given in 
Pipetined Address on page 70; tnitiating and Maintain- 
ing Pipelined Address on page 70; and, Pipelined 
Address with Dynamic Data Bus Sizing on page 77. 


Read and Write Cycles 


introduction 


Data transfers occur as a result of bus cycles, classified 
as Read or Write cycles. During Read cycles, data is 


TAP 
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transferred from an external device to the processor. 
During Write cycles, data is transferred in the other 
direction, from the processor to an external device. 


Two choices of address timing are dynamically select- 
able: non-pipelined or pipelined. After a bus idle state, 
the processor always uses non-pipelined address tim- 
ing. However, the NA (Next Address) input may be 
asserted to select pipelined address timing for the next 
bus cycle. When pipelining is selected and the 
Am386DE microprocessor has a bus request pending 
internally, the address and detinition ot the next cycle is 
made available even betore the current bus cycle is 
acknowledged by READY. Generally, the NA input is 
sampled each bus cycle to select the desired address 
timing for the next bus cycle. 


Two choices of physical data bus width are dynamically 
selectable: 32 bits or 16 bits. Generaily, the BS16 (Bus 
Size 16) input is sampled near the end of the bus cycle to 
contirm the physical data bus size applicable to the cur- 
rent cycle. Negation of BS16 indicates a 32-bit size and 
assertion indicates a 16-bit bus size. 


If 16-bit bus size is indicated, the AM386DE CPU auto- 
matically responds as required to complete the transfer 
on a 16-bit data bus. Depending on the size and 
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Two-Bank Interleaved Memory: 
a. Address signal A2 selects bank 
b. 32-bit datapath to each bank 
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Four-Bank Interlaaved Memory: 
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b. 32-bit datapath to each bank 
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Figure 39. Two-Bank and Four-Bank Interleaved Memory Structure 
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alignment of the operand, another 16-bit bus cycle may 
be required. Table 21 provides all details. When neces- 
sary, the AM386DE microprocessor performs an addi- 
tional 16-bit bus cycle, using D15—D0 in place of 
D31-D16. 


Terminating a Read cycle or Write cycle, like any bus 
cycle, requires acknowledging the cycle by asserting 
the READY input. Until acknowledged, the processor 
inserts wait states into the bus cycle to allow adjustment 
for the speed of any external device. External hardware 
that has decoded the address and bus cycle type 
asserts the READY input at the appropriate time. 


At the end of the second bus state within the bus cycle, 
READY is sampled. At that time, if external hardware 
acknowledges the bus cycle by asserting READY, the 
bus cycle terminates as shown in Figure 40. If READY is 
negated as in Figure 41, the cycle continues another 
bus state (a wait state) and READY is sampled again at 


the end of that state. This continues indefinitely until the 
cycle is acknowledged by READY asserted. 


When the current cycle is acknowledged, the AM386DE 
microprocessor terminates it. When a Read cycle is 
acknowledged, the Am386DE CPU latches the informa- 
tion present at its data pins. When a Write cycle is 
acknowledged, the Am386DE CPU write data remains 
valid throughout phase one of the next bus state to pro- 
vide write data hold time. 


Non-Pipelined Address 


Any bus cycle may be performed with non-pipelined 
address timing. For example, Figure 40 shows a mix- 
ture of Read and Write cycles with non-pipelined 
address timing. Figure 40 shows that the fastest pos- 
sible cycles with non-pipelined address have two bus 
states per bus cycle. The states are named T1 and T2. in 
phase one of the T1, the address signals and bus cycle 
definition signals are driven valid, and to signal their 
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Idle states are shown here for diagram variety only. Write cycles are not always followed by an idle state. 


An active bus cycle can immediately follow the write cycle. 
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Figure 40. Various Bus Cycles and Idle States with Non-Pipelined Address (Zero Walt States) 


availability, address status (ADS) is simultaneously 
asserted. 


During Read or Write cycles, the data bus behaves as 
follows. If the cycle is a read, the Am386DE micropro- 
cessor floats its data signals to allow driving by the exter- 
nal device being addressed. The Am386DE device 
requires that all data bus pins be at a valid logic state 
(High or Low) at the end of each read cycle, when 
READY is asserted, even if all byte enables are not 
asserted. The system must be designed to meet this 


requirement. If the cycle is a write, data signals are 
driven by the Am386DE device beginning in phase two 
of T1 until phase one of the bus state following cycle 
acknowledgment. 


Figure 41 illustrates non-pipelined bus cycles with one 
wait added to Cycles 2 and 3. READY is sampled 
negated at the end of the first T2 in Cycles 2 and 3. 
Therefore, Cycles 2 and 3 have T2 repeated. At the end 
of the second T2, READY is sampled asserted. 
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Idia states are shown here for diagram variety only. Write cycles are not always followed by an idle state. 


An active bus cycle can immediately fallow the write cycle. 


Figure 41. Various Bus Cycles and Idie States with Non-Pipelined Address 
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(Varlous Number of Walt States) 


When address pipelining is not used, the address and 
bus cycle definition remain valid during all wait states. 
When wait states are added and you desire to maintain 
non-pipelined address timing, it is necessary to negate 
NA during eachT2 state except the last one, as shown in 
Figure 41 Cycles 2 and 3. It NAis sampled asserted dur- 
ing a T2 other than the last one, the next state would be 
T2l (for pipelined address) or T2P (for pipelined 
address) instead of another T2 (for non-pipelined 
address). 


Figure 42 illustrates the bus states and transitions when 
address pipelining is not used. The bus transitions 
between four possible states: T1, T2, Ti, and Th. Bus 
cycles consist of T1 and T2, with T2 being repeated for 
wait states. Otherwise, the bus may be idle in the Ti 
state, or in hold acknowledge, the Th state. 


When address pipelining is not used, the bus state dia- 
gram is as shown in Figure 42. When the bus is idle, itis 
in state Ti. Bus cycles always begin with T1. T1 always 
leads to T2. If a bus cycle is not acknowledged during T2 
and NA is negated, T2 is repeated. When a cycle is 
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RESET 
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Notes: 
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HOLD Asserted 


READY Asserted e HOLD Asserted 


ALWAYS 


READY Asserted « 
HOLD Nagated « 
Request Pending 


READY Negated « 
NA Negated 


T1—First clock of a non-pipelined bus cycle (AM386DE microprocessor drives new address and asserts ADS). 
T2-—-Subsequent clocks of a bus cycle when NA has not been sampled asserted in the current bus cycle. 


TiH-Idle state. 


Th—Hold Acknowledge state (Am386DE microprocessor asserts HLDA). 


The fastest bus cycle consists of two states: T1 and T2. 


Four basic bus states describe bus operation when not using pipelined address. These states do include BS16 usage for 
32-bit and 16-bit bus size. If asserting BS16 requires second 16-bit bus cycle to be performed, it is performed before 


HOLD asserted acknowledged. 
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Figure 42. Bus States (Not Using Pipelined Address) 


acknowledged during T2, the following state willbe T1 of 
the next bus cycle if a bus request is pending internally, 
or Tiifthere is no bus request pending, or Th ifthe HOLD 
input is being asserted. 


The bus state diagram in Figure 42 also applies to the 
use of BS16. If the AmM386DE microprocessor makes 
internal adjustments for 16-bit bus size, the adjustments 
do not affect the external bus states. If an additional 
16-bit bus cycle is required to complete a transfer ona 
16-bit bus, it also follows the state transitions shown in 
Figure 42. 


Use of pipelined address allows the Am386DE CPU to 
enter three additional bus states not shown in Figure 42. 
Figure 48 is the complete bus state diagram, including 
pipelined address cycles. 


Non-Pipelined Address With Dynamic Data Bus 
Sizing 


The physical data bus width for any non-pipelined bus 
cycle can be either 32 or 16 bits. At the beginning of the 
bus cycle, the processor behaves as if the data bus is 
32 bits wide. When the bus cycle is acknowledged by 
asserting READY at the end of a T2 state, the most 
recent sampling of BS16 determines the data bus size 
for the cycle being acknowledged. If BS16 was most 
recently negated, the physical data bus size is detined 
as 32 bits. If BS16 was most recently asserted, the size 
is defined as 16 bits. 


When BS16 is asserted and two 16-bit bus cycles are 
required to complete the transfer, BS16 must be 
asserted during the second cycle; 16-bit bus size is not 
assumed. Like any bus cycle, the second 16-bit cycle 
must be acknowledged by asserting READY. 
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When a second 16-bit bus cycle is required to complete 
the transfer over a 16-bit bus, the addresses generated 
for the two 16-bit bus cycles are closely related to each 
other. The addresses are the same, except BEO and 
BE1 are always negated for the second cycle. This is 
because data on 015—D0 was already transferred dur- 
ing the first 16-bit cycle. 


Figure 43 and Figure 44 show cases where assertion of 
BS16 requires a second 16-bit cycle for complete oper- 
and transfer. Figure 43 illustrates cycles without wait 
states. Figure 44 illustrates cycles with one wait state. In 
Figure 44 Cycle 1, the bus cycle during which BS16 is 
asserted, note that NA must be negated in the T2 
state(s) prior to the last T2 state. This is to allow the rec- 
ognition of BS16 asserted in the final T2 state. The rela- 
tion of NA and BS{6 is given fully in Pipelined Address, 
but Figure 44 illustrates this only precaution you need to 
know when using BS16 with non-pipelined address. 


Pipelined Address 


Address pipelining is the option of requesting the 
address and the bus cycle definition of the next internally 
pending bus cycle before the current bus cycle is 
acknowledged with READY asserted. ADS is asserted 
by the Am386DE microprocessor when the next 
address is issued. The address pipelining option is con- 
trolled on acycle-by-cycle basis with the NA input signal. 


Once a bus cycle is in progress and the current address 
has been valid for at least one entire bus state, the NA 
input is sampled at the end of every phase one until the 
bus cycle is acknowledged. During non-pipelined bus 
cycles, therefore, NAis sampled at the end of phase one 
in every T2. An example is Cycle 2 in Figure 45, during 
which NA is sampied at the end of phase one of every T2 
(it was asserted once during the first T2 and has no fur- 
ther effect during that bus cycle). 


It NA is sampled asserted, the Am386DE microproces- 
sor is free to drive the address and bus cycle definition of 
the next bus cycle, and assert ADS, as soon as it has a 
bus request internally pending. It may drive the next 
address as early as the next bus state, whether the cur- 
rent bus cycle is acknowledged at that time or not. 


Regarding the details of address pipelining, the 
Am386DE CPU has the following characteristics. 


1. For NA to be sampled asserted, BS16 must be 
negated at the sampling window (see Figure 56 
Cycles 2 through 4, and Figure 46 Cycles 1 through 
4). If NA and BS16 are both sampled asserted during 
the last T2 period of a bus cycle, BS16 asserted has 
priority. Therefore, if both are asserted, the current 
bus size is taken to be 16 bits and the next address is 
not pipelined. 

2. The next address may appear as early as the bus 
state after NA was sampled asserted (see Figure 45 
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or Figure 46). In that case, state T2P is entered 
immediately, However, when there is not an internal 
bus request already pending, the next address will 
not be available immediately after NA is asserted 
and T2I is entered instead of T2P (see Figure 47 
Cycie 3). Provided the current bus cycle is not yet 
acknowledged by READY asserted, T2P will be 
entered as soon as the AmM386DE microprocessor 
does drive the next address. External hardware 
should therefore observe the ADS output as 
confirmation the next address is actually being 
driven on the bus. 


3. Once NA is sampled asserted, the Am386DE 
microprocessor commits itself to the highest priority 
bus request that is pending internally. !tcan no fonger 
perform another 16-bit transfer to the same address 
should BS16 be asserted externally, so thereafter 
assumes the current bus size is 32 bits. Therefore, if 
NA is sampled asserted within a bus cycle, BS16 
must be negated thereafter in that bus cycle (see 
Figure 45, Figure 46, Figure 47). Consequently, do 
not assert NA during bus cycles that must have BS16 
driven asserted. See Dynamic Bus Sizing with 
Pipelined Address. 


4. Any address which is validated by a pulse on the 
Am386DE CPU ADS output will remain stable on the 
address pins for at least two processor clock periods. 
The Am386DE microprocessor cannot produce a 
new address more frequently than every two 
processor clock periods (see Figure 45, Figure 46, 
Figure 47). 


5. Only the address and bus cycle definition of the very 
next bus cycle is available. The pipelining capability 
cannot look further than one bus cycle ahead (see 
Figure 47 Cycle 1). 


The complete bus state transition diagram, including 
operation with pipelined address is given by Figure 48. 
Note it is a superset of the diagram for non-pipelined 
address only and the three additional bus states for 
pipelined address are drawn in bold. 


The fastest bus cycle with pipelined address consists of 
just two bus states, T1P and T2P (recall for non-pipe- 
lined address itis T1 and T2). T1P is the first bus state of 
a pipelined cycle. 


initiating and Maintaining Pipelined Address 


Using the state diagram Figure 48, observe the transi- 
tions from an idle state, Ti, to the beginning of apipelined 
bus cycle, T1P. From an idle state Ti, the first bus cycle 
must begin with T1, and is therefore a non-pipelined bus 
cycle. The next bus cycle willbe pipelined, however, pro- 
vided NA is asserted and the first bus cycle ends in a 
T2P state (the address for the next bus cycle is driven 
during T2P). The fastest path from an idle state to a bus 
cycle with pipelined address is shown in below: 
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Figure 43. Asserting BS16 (Zero-Wait-States, Non-Pipelined Address) 
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Figure 44. Asserting BS16 (One-Walt-State, Non-Pipellned Address) 
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Note: 

Following any idle bus state (Ti), addresses are non-pipelined. Within non-pipelined bus cycles, NA is only sampled 
during wait states. Therefore, to begin address pipelining during a group of non-pipelined bus cycles requires a 
non-pipelined cycle with at least one wait state (Cycle 2 above). 
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Figure 45. Transitioning to Pipelined Address During Burst of Bus Cycles 
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Note: 

Following any idle bus state (Ti), the address is always non-pipelined and NA is only sampled during wait states. To 
start address pipelining after an idle state requires a non-pipelined cycle with at least one wait state (Cycle 1 
above). The pipelined cycles (2, 3, 4 above) are shown with various numbers of wait states. 
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Figure 46. Fastest Transition to Pipelined Address Following Idle Bus State 
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Figure 47. Detalls of Address Pipelining During Cycles with Wait States 
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T2I—Subsequent clocks of a bus cycle when NA has been 
sampled asserted in the current bus cycle but there is not yet an 


internal bus request pending (Am386DE CPU will not drive new NA Assertad » 


address or assert ADS). HOLD ae ° 
T2P—Subsequent clocks of a bus cycle when NA has been Request Pending 


sampled asserted in the current bus cycle and there is an internal 

bus request pending (Am386DE CPU drives new address and 

asserts ADS). T1P—First clock of a pipelined bus cycle. READY Asserted 
TiHIdle state. 

Th—Hold Acknowledge state (Am386DE CPU asserts HLDA). 

Asserting NA for pipelined address gives access to three more bus VW 


states: T2l, T2P, and T1P. HEADY Negated 


Using pipelined address, the fastest bus cycle consists of T1P and 
T2P. 
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Figure 48. Am386DE Microprocessor Complete Bus States (Including Pipelined Address) 
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Ti, Ti, Ti T1-T2-T2P T1P-T2P 
a a ee 
Idle Non-Pipelined Pipelined 
States Cycle ycle 


T1-T2-T2P are the states of the bus cycle that esta- 
blishes address pipelining for the next bus cycle, which 
begins with T1P. The same is true after a bus hold state, 
shown below: 


Th, Th, Th T1-T2-T2P TiP-T2P 
| et — 
Hold Non-Pipelined Pipemied 
Acknowledge Cycle ycle 
States 


The transition to pipelined address is shown functionally 
by Figure 46 Cycle 1. Note that Cycle 1 is used to transi- 
tion into pipelined address timing for the subsequent 
Cycles 2, 3, and 4 that are pipelined. The NA input is 
asserted at the appropriate time to select address 
pipelining for Cycles 2, 3, and 4. 


Once a bus cycle is in progress and the current address 
has become valid, the NA input is sampled at the end of 
every phase one, beginning with the next bus state, until 
the bus cycle is acknowledged. During Figure 46 Cycle 
1 therefore, sampling begins in T2. Once NA is sampled 
asserted during the current cycle, the Am386DE micro- 
processor is free to drive a new address and bus cycle 
definition on the bus as early as the next bus state. In 
Figure 45 Cycle 1 for example, the next address is 
driven during state T2P. Thus, Cycle 1 makes the transi- 
tion to pipelined address timing, since it begins with T1 
but ends with T2P. Because the address for Cycle 2 is 
available before Cycle 2 begins, Cycle 2 is called a pipe- 
lined bus cycle, and it begins with T1P. Cycle 2 begins as 
soon as READY asserted terminates Cycle 1. 


Example transition bus cycles are Figure 46 Cycle 1 and 
Figure 45 Cycle 2. Figure 46 shows transition during the 
very first cycle after an idle bus state, which is the fastest 
possible transition into address pipelining. Figure 45 
Cycle 2, shows a transition cycle occurring during a 
burst of bus cycles. In any case, a transition cycle is the 
same whenever it occurs: it consists at least of T1, T2 
(you assert NA at that time), and T2P (provided the 
Am386DE microprocessor has an internal bus request 
already pending, which it almost always has). T2P 
States are repeated if wait states are added to the cycle. 


Note three states (T1, T2, and T2P) are only required in 
a bus cycle performing a transition from non-pipelined 
address into pipelined address timing; for example, 
Figure 46 Cycle 1. Figure 46 Cycles 2, 3, and 4 show 
that address pipelining can be maintained with two-state 
bus cycles consisting only of T1P and T2P. 
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Once a pipelined bus cycle is in progress, pipelined tim- 
ing is maintained for the next cycle by asserting NA and 
detecting that the Am386DE CPU enters T2P during the 
current bus cycle. The current bus cycle must end in 
state T2P for pipelining to be maintained in the next 
cycle. T2P is identified by the assertion of ADS. 
Figure 45 and Figure 46 however, show pipelining end- 
ing after Cycle 4, because Cycle 4 ends in T2P. This indi- 
cates the AmM386DE CPU did not have an internal bus 
request prior to the acknowledgment of Cycle 4. If a 
cycle ends with a T2 or T2!, the next cycle will not be 
pipelined. 


Realistically, address pipelining is almost always main- 
tained as long as NA is sampled asserted. This is so, 
because inthe absence of any other request a code pre- 
fetch request is always internally pending until the 
instruction decoder and code prefetch queue are com- 
pletely full. Therefore, address pipelining is maintained 
for long bursts of bus cycles, if the bus is available (i.e., 
HOLD negated) and NA is sampled asserted in each of 
the bus cycles. 


Pipelined Address With Dynamic Data Bus Sizing 


The BS16 feature allows easy interface to 16-bit data 
buses. When asserted, the Am386DE microprocessor 
bus interface hardware performs appropriate action to 
make the transfer using a 16-bit data bus connected on 
D15-D0. 


There is a degree of interaction, however, between the 
use of Address Pipelining and the use of Bus Size 16. 
The interaction results from the multiple bus cycles 
required when transferring 32-bit operands over a 16-bit 
bus. If the operand requires both 16-bit halves of the 
32-bit bus, the appropriate AM386DE microprocessor 
action is a second bus cycle to complete the operand’s 
transfer. This necessity conflicts with NA usage. 


When NA is sampled asserted, the Am386DE micropro- 
cessor commits itself to perform the next internally 
pending bus request, and is allowed to drive the next 
internally pending address onto the bus. Asserting NA 
therefore makes it impossible for the next bus cycle to 
again access the current address on A31—-A2, such as 
may be required when BS16 is asserted by the external 
hardware. 


To avoid conflict, the Am386DE microprocessor is 
designed with the following two provisions. 


1. To avoid conflict, BS16 must be negated in the current 
bus cycle if NA has already been sampled asserted 
in the current cycle. If NA is sampled asserted, the 
current data bus size is assumed to be 32 bits. 

2. Also to avoid conflict, if NA and BS16 are both 
asserted during the same sampling window, BS16 
asserted has priority and the Am386DE_ micro- 
processor acts as if NA was negated at that time. 
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Certain types of 16- or 8-bit operands require no adjust- 
ment for correct transter on a 16-bit bus. Those are read 
or write operands using only the lower halt of the data 
bus, and write operands using only the upper half of the 
bus, since the Am386DE CPU simultaneously dupli- 
cates the write data on the lower half of the data bus. For 
these patterns of Byte Enables and the W/R signals, 
BS 16 need not be asserted at the AM386DE CPU allow- 
ing NA to be asserted during the bus cycle if desired. 


Interrupt Acknowledge (INTA) Cycies 


In response to an interrupt request on the INTR input 
when interrupts are enabled, the Am386DE micropro- 
cessor performs two interrupt acknowledge cycles. 
These bus cycles are similar to read cycles in that bus 
definition signals define the type of bus activity taking 
place, and each cycle continues until acknowledged by 
READY sampled asserted. 


The state of A2 distinguishes the first and second inter- 
rupt acknowledge cycles. The byte address driven dur- 
ing the first interrupt acknowledge cycle is 4 (A31—A3 
Low, A2 High, BE3-BE1 High, and BEO Low). The 
address driven during the second interrupt acknowl- 
edge cycle is 0 (A31-A2 Low, BE3-BE1 High, BEO 
Low). 


The LOCK output is asserted from the beginning of the 
first interrupt acknowledge cycle until the end of the 
second interrupt acknowledge cycle. Four idie bus 
States, Ti, are inserted by the Am386DE microprocessor 
between the two interrupt acknowledge cycles, allowing 
forcompatibility with spec TRHRL of the 82594 Interrupt 
Controller. 


During both interrupt acknowledge cycles, D31—D0 
float. No data is read at the end of the first interrupt 
acknowledge cycle. At the end of the second interrupt 
acknowledge cycle, the Am386DE microprocessor will 
read an external interrupt vector from D7—D0 of the data 
bus. The vector indicates the specific interrupt number 
(from 0-255) requiring service. 


Halt Indication Cycle 


The Am386DE microprocessor halts as a result of 
executing a HALT instruction. Signaling its entrance into 
the halt state, a halt indication cycle is performed. The 
halt indication cycle is identified by the state of the bus 
definition signals shown in Bus cycle Definition and a 
byte address of 2. BEO and BE2 are the only signals dis- 
tinguishing halt indication from shutdown indication, that 
drives an address of 0. During the halt cycle undefined 
data is driven on D31-D0. The halt indication cycle must 
be acknowledged by READY asserted. 


Ahalted Am386DE CPU resumes executionwhen INTR 
(if interrupts are enabled) or NMI or RESET is asserted. 
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Shutdown indication Cycie 


The Am386DE microprocessor shuts down as a result 
of a protection fault while attempting to process a double 
fault. Signaling its entrance into the shutdown state, a 
shutdown indication cycle is performed. The shutdown 
indication cycle is identified by the state of the bus defini- 
tion signals shown in Bus Cycle Definition and a byte 
address of 0. BEO and BE2 are the only signals distin- 
Quishing shutdown indication from halt indication, which 
drives an address of 2. During the shutdown 
cycle, undefined data is driven on D31-D0. The shut- 
down indication cycle must be acknowledged by 
READY asserted. 


Ashutdown Am386DE microprocessor resumes execu- 
tion when NMI or RESET is asserted. 


Other Functional Descriptions 


Entering and Exiting Hold Acknowledge 


The Bus Hold Acknowledge State, Th, is entered in 
response to the HOLD input being asserted. In the Bus 
Hold Acknowledge state, the Am386DE microprocessor 
floats all output or bidirectional signals, except for 
HLDA. HLDA is asserted as long as the AmM386DE CPU 
remains in the bus hold acknowledge state. In the Bus 
Hold Acknowledge state, all inputs except HOLD, FLT, 
RESET, BUSY, ERROR, and PEREQ are ignored (also 
up to one rising edge on NMI is remembered for proces- 
sing when HOLD is no longer asserted). 


Th may be entered from a bus idle state, as in Figure 53, 
or after the acknowledgment of the current physical bus 
cycle if the LOCK signal is not asserted, as in Figure 54 
and Figure 55. If HOLD is asserted during a flocked bus 
cycle, the Am386DE microprocessor may execute one 
unlocked bus cycle before acknowledging HOLD. If 
asserting BS16 requires a second 16-bit bus cycle to 
complete a physical operand transfer, it is performed 
before HOLD is acknowledged, although the bus state 
diagrams in Figure 42 and Figure 48 do not indicate 
that detail. 


Th is exited in response to the HOLD input being 
negated. The following state will be Ti as in Figure 53 if 
no bus request is pending. The following bus state will 
be T1 if a bus request is internally pending, as in 
Figure 54 and Figure 55. 


This also exited in response to RESET being asserted. 


If a rising edge occurs on the edge-triggered NMI input 
while in Th, the event is remembered as a non-mask- 
able interrupt 2 and is serviced when This exited, unless 
of course, the AM386DE microprocessor is reset before 
Th is exited. 


RESET During HOLD Acknowledge 


RESET being asserted takes priority over HOLD being 
asserted. Therefore, Th is exited in response to the 
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Atransfer requiring two 
cycles on 16-bit data bus 


Previous Cycle 1 Cycle 1A Cycle 2 
Cycle Pipelined Non-Pipelined Non-Pipelined 
(Write Write) (Read) 
Part One Part Two 


CLK2 [ 
(CLK) [ 
BET_-BEO [ 
BE3-BE2, 
A31-A2, [ 
M/O, DIG 
wh [ 
ADSL 
Note: NA must be negated in these Ts to allow 
recognition of asserted BS16 in final T2s. 
na [ XK Dont CareXXIY XIX Don't Care XX 
32-Bit 
Bus Size 
esr [| KXXXKXXXD | AKXXXIKXMD | AKXXXDY 
READY | en Ox | AXXXXY 


coor [ [KF vas XK vaia2 
o-oo [ Cin) (a yf St 


o31-p16 [_ 


Notes: 
Cycle 1 is pipelined. Cycle 1A cannot be pipelined, but its address can be inferred from that 
of Cycle 1, to externally simulate address pipelining during Cycle 1A. 


Key. Dn -= Physical data pin n 
dn = Logical data pinn 


Figure 49. Using NA and BS16 
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Previous Interrupt Idle Interrupt Idle 
Cycle Acknowledge (4 Bus States) Acknowledge 
— 1 Cycle 2 
T2 


CLK2 [ 


asl 


3BEIL XKXXXY 


RX KKK KKK ea at all XXX napa iia tad 
pxna a XXXXD lo AXXNKXXX 


BEO, © 
A31-A3, [ XXX 


M/d, D/C, 

ic a ele ales 
as ee XXX) AXXAKXXX 
= Seegtee 


<i acm 


NAL i X XK XX ie i KIX KX KIX) v, X wUIKX KX XX “XX ima rl fo vXX ieee XS Zz X 


a2 [ XXXXXD 


tock, XXXXX 


‘an 
ian 


Bsié [ ooo CAXKX ignored MXKX = MKXAMKKAAKK emia bed XXX 
Y 


renoy[ XXXXX) =e ei yes XXXKXXXKXXKKKY | - 


Ignored Vector 
o7-00[ J---+} --F --4-{=)-—}---F --} --4f---- -- 4 -{- 

Ignored \gnored 
p31-p8 [_ SS a as EN Dee een (acre) bee SO 


Note: 

Interrupt Vector (0-255) is read on D7—D0 at end of second Interrupt Acknowledge bus cycle. 
Because each Interrupt Acknowledge bus cycle is followed by idle bus states, asserting NA has no 
practical effect. Choose the approach that is simplest for your system hardware design. 


Figure 50. Interrupt Acknowledge Cycles 18421A-053 
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Cycle 1 Cycle 1A Idle 
Non-Pipelined Non-Pipetined 
(Write) (Halt) 
cike [ 
(CLK) [ 
aes Am386DE CPU remains 
BES, BET, E halted until INTR, NMI, or 
BEO, M10 


RESET is asserted. 


| | 
Am386DE CPU responds to 
HOLD input while in the Halt 
state. 


BE2, A31-A2, [ 
WIR, D/C 


<I 
> 


> 
2 


a 
12 

rq 
Lit ah 


NAL XXXKXKXXXKXXKKXKNKKX DKK DOK 
ssi [ jem ps XK sree XXX NXXKMAX KAD XIAN 
READY [ | £5 KX - mm | 6 AXXXNKKANKKAM 
Note: 
Halt cycle must be acknowledged by 
READY asserted. Wait states may be 
added to the cycle if desired. 
tock [ JK valid / XXXAKXXAXXKAKXKM 


D31-Do [ Out 


Figure 51. Halt Indication Cycle 18421A-054 
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CLK2 


(CLK) 


BE3-BET, 
MAO, W/R 


BEO, 
A31-A2, D/C 


ADS 


D31—Do 
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Cycle 1 Cycle 2 Idle 
Pipelined Pipelined 
(Read) (Shutdown) 


m7. ™ 


Am386DE CPU remains 
shutdown until NMI or 
RESET is asserted. 


Am386DE CPU responds 
to HOLD input while in the 
Shutdown state. 


[ XX 


| 


[ AKXXXKXXMAXKAKKXA KAKA 


XAAXXAKXXAKXXXKXKN 


| om RXXXXXXXLXXN 


Note: 

Shutdown cycle must be acknowledged 
by READY asserted. Wait states may be 
added to the cycle if desired. 


if 
af 


DX 


C | AX | AXXKKD 


XX AXXNAXXAAXA 


pe valet | 


[ Valid 1 7 


(Floating) 


| 


Figure 52. Shutdown Indication Cycle 


Undefined 
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Idle 


CLK2 ff 

(CLK) [ 

HOLD [ 

HLDA [ 
BES-BEO, 
A31-A2, M/O, 
D/G, WA 

nos [ 


XXX 


psi-po [ 


Note: 


Hold 
Acknowledge 


Vara, V, 


XXXKXXXKXXAKKKN 


—-—-— (Floating) -—— 


Idle 


For maximum design flexibility, the Am386DE CPU has no internal pullup resistors an its outputs. The design may 
require an external pullup on ADS and other Am386DE CPU outputs to keep them negated during float periods. 


18421 A-056 


Figure 53. Requesting Hold from Idle Bus 


RESET input being asserted. If RESET is asserted while 
HOLD remains asserted, the AM386DE microprocessor 
drives its pins to defined states during reset, as in Table 
15, Pin State During RESET, and performs internal reset 
activity as usual. 


If HOLD remains asserted when RESET is negated, the 
Am386DE microprocessor enters the hold acknowledge 


state before performing its first bus cycle, provided 
HOLD is still asserted when the Am386DE microproces- 
sor would otherwise perform its first bus cycle. If HOLD 
remains asserted when RESET is negated, the BUSY 
input is still sampled as usual to determine whether a 
self test is being requested, and ERROR is still sampled 
as usual to determine whether a 387DX math coproces- 
sor versus an 80287 (or none) is present. 
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Float 


Activating the FLT input floats all Am386DE CPU 
bidirectional and output signals, including HLDA. 
Asserting FLT isolates the Am386DE CPU from the sur- 
rounding circuitry. 


As the Am386DE microprocessor is packaged in a sur- 
face mount PQFP, it cannot be removed from the 
motherboard when In-Circuit Emulation (ICE) is 
needed. The FLT input allows the AmM386DE CPU to be 
electrically isolated from the surrounding circuitry. This 
allows connection of an emulator to the Am386DE 
microprocessor PQFP without removing it from the 
PCB. This method of emulation is referred to as ON- 
Circuit Emulation (ONCE). 


Entering and Exiting Float 


FLT is an asynchronous, active Low input. It is recog- 
nized on the rising edge of CLK2. When recognized, it 
aborts the current bus cycle and floats the outputs of the 
Am386DE microprocessor (Figure 57). FLT must be 
held Low for a minimum of 16 CLK2 cycles. Reset 
should be asserted and held asserted until after FLT is 
deasserted. This will ensure that the Am386DE CPU will 
exit Float in a valid state. 


Asserting the FLT input unconditionally aborts the cur- 
rent bus cycle and forces the Am386DE microprocessor 
into the Float mode. Since activating FLT uncondition- 
ally forces the Am386DE CPU into Float mode, the 
Am386DE CPU is not guaranteed to enter Float in a 
valid state. After deactivating FLT, the AmM386DE CPU is 
not guaranteed to exit Float mode in a valid state. This is 
not a problem, as the FLT pin is meant to be used only 
during ONCE. After exiting Float, the Am386DE CPU 
must be reset to return it to a valid state. Reset should be 
asserted before FLT is deasserted. This will ensure that 
the Am386DE CPU will exit Float in a valid state. 


FLT has an internal pull-up resistor, and if itis not used it 
should be unconnected. 


Bus Activity During and Following Reset 


RESET is the highest priority input signal capable of 
interrupting any processor activity when it is asserted. A 
bus cycle in progress can be aborted at any stage; or idle 
states or bus hold acknowledge states discontinued so 
that the RESET state is established. 


RESET should remain asserted for at least 15 CLK2 
periods to ensure it is recognized throughout the 
Am386DE microprocessor, and at least 80 CLK2 peri- 
ods if AM386DE device self-test is going to be requested 
at the falling edge. RESET asserted pulses less than 15 
CLK2 periods may not be recognized. RESET pulses 
less than 80 CLK2 periods followed by a self-test may 
cause the self-test to report a failure when no true failure 
exists. 


The additional RESET pulse width is required to clear 
additional state prior to valid self-test. 


Provided the RESET falling edge meets setup and hold 
times, t25 and t26, the intemal processor clock phase is 
defined at that time, as illustrated by Figure 67. 


An Am386DE microprocessor self-test may be 
requested at the time RESET is negated by having the 
BUSY input at a Low level, as shown in Figure 56. The 
self-test requires (22°) + approximately 60 CLK2 periods 
to complete. The self-test duration is not affected by the 
test results. Even if the self-test indicates a problem, the 
Am386DE device attempts to proceed with the reset 
sequence afterward. 
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After the RESET falling edge (and after the self-test if it 
was requested) the Am386DE microprocessor per- 
forms an internal initialization sequence for approxi- 
mately 350 to 450 CLK2 periods. 


The Am386DE microprocessor samples its ERROR 
input some time after the falling edge of RESET and 
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sampling period BUSY must be High. If ERROR was 
sampled active, the Am386DE device employs the 
32-bit protocol of a 387DX math coprocessor. Even 
though this protocol was selected, it is still necessary to 
use a software recognition test to determine the pres- 
ence or identity of the coprocessor and to assure com- 
patibility with future processors. 


before executing the first ESC instruction. During this 


32-Bit Bus ow 


Cycle 1 Hold Cycle 2 
Non-Pipelined Acknowledge Non-Pipelined 
(Read) (Write) 
T1 T2 
cik2 [ 
HoLD [ 
HOLD asserted no later 
than READY asserted 
HLDA [ 
we wemes [bc de eee — 
31-82, Dewi L Cage cee Valid 2 
(Floating) 
pi: a ice sens fp 
TKO XD POPPI xX POO - x 


BOSODOOX XXX SOOO = 


Note: if asserting a 6 requires a second bus cycle 
to be performed, the second cycle is 
performed before Hold Acknowledge. 


XK KKK XX 
(Negated, or Last Locked Cycle) 


Vs TT ee 


(Floating) 


iE 


Note: 
HOLD is a synchronous input and can be asserted at any CLK2 edge, provided setup and hold (123 and 124) requirements 
are met. This waveform is useful for determining Hold Acknowledge latency. 


Figure 54. Requesting Hold from Active Bus (NA Negated) 18421A-057 
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Cycle 1 Hold Cycle 2 
Pipelined Acknowledge Non-Pipelined 
(Write) (Read) 


CLK2 
(CLK) 


HOLD 
HOLD asserted in same 
bus state as NA asserted 


HLDA 
MiO, BES-BES, ; 
A31-A2, DE, WAR [ KKXKXKKAKXMD 


ADS (Floating) 


: OK KADEKK AIK KAKA KKMRIAVRK KIT KM 


RDOQOOOPOODOPODDD ADOT 


XOXOXO SON AO OA OOO OOO AON 
(Negated or Last Locked Cycle) 


a 
a4; 
oe Se sae el 


Valid 2 


D31-D00 


Note: 
HOLD is a synchronous input and can be asserted at any CLK2 edge, provided setup and hold (t23 and t24) 
requirements are met. This waveform is useful for determining Hold Acknowledge latency. 


Figure 55. Requesting Hold from Active Bus (NA Asserted) 18421 A-058 
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+—— Reset ———»|«————___Inlernal ___ Cycle 1 


Initialization 


2 15 CLK2 duration if not going It self-test is performed Non-Pipelined 


to request self-test. 20 « Read 
2 80 CLK2 duration before ( sed Ag eae 


requesting self-test. 


1 2 3 1]17 18 419 395 396 397 398 
cLk2 [ 
Reset [ 
Jo2|o1|o2}}o1]o2 
Ee OO Odo © @ Be, 
he end Negated to allow sensing of a 
No setf-test 387DX math ci 


rocessor 
susy [ XXXKXXXKXXD XY T It ROKK 


Low to begin self-test (Note 2) Asserted to indicate 387DX 
math coprocessor protocol 


$ 
ERROR L XXXXXXXXKXXAKXXXXXKXXAXXKAXADKAK | IL KXKKXX 


BES-BES. W/ Up to 30 CLK2 fl 
M/IO, HLDA [ XXXKXKX A Low ]] During Reset AXXXX i X X XO MX xX Valid 1 


= pias 
pete Up to 30 CLK2 aaa dle al 


LOGK [ XXXXKKKY High}] During Reset XXXXXM XXX x XIX Valid 1 
(proa0 cue ai ae 


ADS XXXXXXK| Highl] During Reset 


NA L XXXXXXXXXKMKXXXXXXXXKXAAKAAD AE Ka XXXKX 
eS aeae| ice 
este L XXXXXXXXXXAKXXXXXXXXAXAXKKADKXKXILA DRAKA 
ADY [ XXKX KXXKAK MOKA KAKA KAKA DK ADX AX XXX XX 
par-po [ XXXXXKX)---- ff oo--- (Floating) ---------]]----- -ff--p----4-- 
Notes: 
“Approximately 


1. BUSY should be held stable for 8 CLK2 periods before and after the CLK2 period in which RESET falling edge occurs. 
2. If self-test is requested, the Am386DE microprocessor outputs remain in their reset state as shown here and in Table 14. 


Figure 56. Bus Activity from Reset Until First Code Fetch eis ates 


2 i Aw AS AU AW A A AW AU AU AU AC AT AU AU AU AC AC AC AU AC AC Ae 
Convel [ XWald ps2 22sec eeeceeeeeeeeeeceees x 
oe ee © ND ee — 
pares (YX Va vos vee ece ence eee eee cece a a 
Reset [ ee Oe 


Figure 57. Entering and Exiting FLT 18421A-060 
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Self-Test Signature 


Upon completion of self-test, (if self-test was requested 
by holding BUSY Low atleast eight CLK2 periods before 
and after the falling edge of RESET), the EAX register 
will contain a signature of 00000000h indicating the 
Am386DE CPU passed its self-test of microcode and 
major PLA contents with no problems detected. The 
passing signature in EAX, 00000000h, applies to all 
Am386DE microprocessor revision levels. Any non- 
zero signature indicates the Am386DE CPU unit is 
faulty. 


Component and Revision Identifiers 


To assist AM386DE microprocessor users, the micro- 
processor after reset holds a component identifier anda 
revision identifier in its DX register. The Am386DE 
microprocessor, after reset, holds a component identi- 
fier and revision identifier in its DX register. In the DH 
portion, the lower nibble 3H refers to the Am386 CPU 
architecture. In the lower DL portion, the upper nibble 
EH identifies the processor as a member of the Am386 E 
CPU series. The lower nibble holds the unsigned binary 
number related to the component revision level. The 
revision identifier begins chronologically with a value 
zero and is subject to change (typically it will be increm- 
ented) with component steppings intended to have cer- 
tain improvements or distinctions from previous 
steppings. 


These features are intended to assist AM386DE micro- 
processor users to a practical extent. However, the revi- 
sion identifier value is not guaranteed to change with 
every stepping revision nor to follow a completely uni- 
form numerical sequence, depending on the type or 
intention of revision or manufacturing materials required 
to be changed. 


Coprocessor Interfacing 


The Am386DE microprocessor provides an automatic 
interface for coprocessor, like a387DX compatible float- 
ing-point math coprocessor. A 387DX math coproces- 
sor uses an l/O-mapped interface driven automatically 
by the Am386DE microprocessor and assisted by three 
dedicated signals: BUSY, ERROR, and PEREQ. 


As the AmM386DE CPU begins supporting a coprocessor 
instruction, it tests the BUSY and ERROR signals to 
determine if the coprocessor can accept its next instruc- 
tion. Thus, the BUSY and ERROR inputs eliminate the 
need for any preamble bus cycles for communication 
between processor and coprocessor. A 387DX math 
coprocessor can be given its command op-code 
immediately. The dedicated signals provide instruction 
synchronization, and eliminate the need of using the 
Am386DE CPU WAIT op-code (9Bh) for 387DX math 
coprocessor instruction synchronization (the WAIT op- 
code was required when 8086 or 8088 was used with the 
8087 coprocessor). 
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Custom coprocessors can be included in Am386DE 
microprocessor-based systems, via memory-mapped 
or /O-mapped interfaces. Such coprocessor interfaces 
allow a completely custom protocol, and are not limited 
to a set of coprocessor protocol primitives. Instead, 
memory-mapped or !/O-mapped interfaces may use all 
applicable Am386DE microprocessor instructions for 
high-speed coprocessor communication. The BUSY 
and ERROR inputs of the Am386DE CPU may also be 
used for the custom coprocessor interface, if such hard- 
ware assist is desired. These signals can be tested by 
the Am386DE CPU WAIT op-code (9Bh). The WAIT 
instruction will wait until the BUSY input is negated 
{interruptable by an NMI or enable INTR input), but gen- 
erates an Exception 16 fault if the ERROR pin is in the 
asserted state when the BUSY goes (or is) negated. If 
the custom coprocessor interface is memory-mapped, 
protection of the addresses used forthe interface canbe 
provided with the Am386DE microprocessor on-chip 
segmentation mechanisms. If the custom interface is 
\/O-mapped, protection of the interface can be provided 
with the Am386DE microprocessor IOPL (I/O Privilege 
Level) mechanism. 


A 387DX math coprocessor interface is I/O mapped as 
shown in Table 23. Note that a 387DX math coproces- 
sor interface addresses are beyond the OhFFFFh range 
for programmed VO. When the Am386DE CPU sup- 
ports a 387DX math coprocessor, the AmM386DE micro- 
processor automatically generates bus cycles to the 
coprocessor interface addresses. 


Table 23. Math Coprocessor Port Addresses 


Address in Am386DE 387DX 
CPU VO Space 


Coprocessor Register 


Opcode Register 
800000F8h (32-bit port) 


To correctly map the 387DX math coprocessor registers 
to the appropriate I/O addresses, connect a 387DX 
math coprocessor CMDO0 pin directly to the A2 output of 
the Am386DE microprocessor. 


Operand Register 
(32-bit port) 


Software Testing for Coprocessor Presence 


When software is used to test for coprocessor (387DX) 
presence, it should use only the following coprocessor 
op-codes: FINIT, FNINIT, FSTCW mem, FSTSW mem, 
FSTSW AX. To use other coprocessor op-codes whena 
coprocessor is known to be not present, first set EM = 1 
in AM386DE microprocessor CRO. 
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ABSOLUTE MAXIMUM RATINGS OPERATING RANGES 


Storage Temperature ............ 65°C to +150°C Supply Voltage with Respect to Vs, ... 0.5 Vto +7 V 
Ambient Temperature Under Bias . —65°C to +125°C Voltage on Other Pins.......... -0.5 V to Veo +0.5 V 


Stresses above those listed may cause permanent device Operating ranges define those limits betwean which the func- 
failure. Functionality at or above these limits is not implied. tionality of the device is guaranteed 


Exposure to ABSOLUTE MAXIMUM RATINGS for extended 
periods may affect device reliability. 


DC CHARACTERISTICS over COMMERCIAL Operating Ranges (PQFP, 33 MHz) 
Voc = 3.0 V to 3.6 V; Tease = 0°C to +100°C (Extended Temperature, Toase = 40°C to 100°C) 


Input Low Voltage 


Output Low Voltage 
lo, = 0.5 mA: A31-A2, D31-Do 
lo, = 0.5 mA: BE3-BEO, W/R, D/C, M/10, 
LOCK, ADS, HLDA 
lo. =2mA: A31-A2, 031-Do 
lo, = 2.5 mA: BES-BEO, W/R, D/C, M/0, 
COCK, ADS, HLDA 
Output High Voltage 
loy = 0.1 mA: A31-A2, D31-Do 
loy = 0.1 mA: BE3-BEO, W/R, 
D/C, M/A, LOCK, 
ADS, HLDA 
low = 0.5 mA: A31-A2, D31-DO 
low = 0.5 mA: BE3-BEO, 
WIR, D/C, MAO, COCK, 
ADS, HLDA 


Input Leakage Current OVS Vins Voc 
(All pins except BS16, PEREQ, 

BUSY, FLT, and ERROR) 

Input Leakage Current Vin =Vcc — 0.1 V 
(PEREQ Pin) Vin =2.4 V (Note 2) 
Input Leakage Current Vi =0.1V 


(BS16, BUSY, FLT, and ERROR) Vii = 0.45 V (Note 3) 


Output Leakage Current 0.1 VS Vout § Voc 


Supply Current (Note 6) Voc = 3.3. V 
CLK2 = 50 MHz: Oper. Freq. 25 MHz loc Typ = 95 
CLK2 = 66 MHz: Oper. Freq. 33 MHz Icc Type = TBD 


Notes: 

1. The Min value, -0.3, is not 100% tested. 

2, PEREQ input has an internal pull-down resistor. 

3. BS16, BUSY, FLT, and ERROR inputs each have an internal pull-up resistor. 
4. Not 100% tested. 

5, Outputs are CMOS and will pull rail-to-rail if the load is not resistive. 

6. Inputs are at either Voc or Ves. 
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ABSOLUTE MAXIMUM RATINGS 
Storage Temperature ............ -65°C to +150°C 
Ambient Temperature Under Bias . —65°C to +125°C 


Stresses above those listed may cause permanent device 
failure. Functionality at or above these limits is not implied. 
Exposure to ABSOLUTE MAXIMUM RATINGS for extended 
periods may affect device reliability. 


OPERATING RANGES 


Supply Voltage with Respect 
tO. Vissi 5 Sonos Daas Gag aie deed oo O0.5Vto+7V 
Voltage on Other Pins .......... 0.5 V to Veg 40.5 V 


Operating ranges define those limits between which the func- 
tionality of the device is guaranteed. 


DC CHARACTERISTICS over COMMERCIAL Operating Ranges (PQFP, 33 MHz) 
Voc = 3.0 V to 5.5 V; Tease = 0°C to +100°C (Extended Temperature, Toase = 40°C to 100°C) 


Symbol 
Vii 


Vou Output High Voltage 
lon = 1mA: A31-A2, 031-—DoO 
lon = 0.9 mA: BE3—-BEO,W/R, 
D/C, M/id, LOCK, 
HLDA 


lu Input Leakage Current 
(All pins except BS16, PEREQ, 


BUSY, FLT, 
and ERROR) 


ry Input Leakage Current Vn =2.4V 
(PEREQ Pin) (Note 2) 


le Input Leakage Current 
(BS16, BUSY, FLT, and ERROR) 


Parameter Description lNotes si 


input Low votiage 
Vic CLK2 Input Low Voltage (Note 1) -0.3 


Vince CLK2 Input High Voltage (33 MHz) 2.7 
Vot Output Low Voltage (Note 5) 
lo. = 4mA: A31-A2, D31-Do 
lo. =5 mA: BE3-BEO, WA, D/C, 
MAO, COCK, ADS, 
HLDA 


OVs Vw S$ Voc 


Vii = 0.45 
(Note 3) 


Preliminary 


nm 
oO 


Nn 
> 


- 
an 
<< 


+15 A 


200 pA 


—400 pA 


ine} 
ow 
i 


lo Output Leakage Current +15 pA 
lec Supply Current (Note 6) Voc = 5.0 V 
CLK2 = 50 MHz: Oper. Freq. 25 MHz } Icc Typ = 160 mA 
CLK2 = 66 MHz: Oper. Freq. 33 MHz ] Icc Typ = 210 mA 
Ow Ce 
Cout pF 


Notes: 
1. The Min value, ~0.3, is not 100% tested. 
2. PEREQ input has an internal pull-down resistor. 


3. BS16, BUSY, FLT, and ERROR inputs each have an internal pull-up resistor. 


4, Not 100% tested. 


5. Outputs are CMOS and will pull rail-to-rail if the load is not resistive. 


6. Inputs are at either Voc or Vss. 
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SWITCHING CHARACTERISTICS over COMMERCIAL operating range — 33 MHz 
Voc = 3.0-5.5 V; Tcase = O°C to +100°C (PQFP) 
(Extended Temperature Tcase = —40°C to 100°C) 


ae 
| Oper. Frequency: Amasede CPU | Hall cLK2irea. | 
Peuke Poros: —Amasebe cru |_| 
2 
3 
[4 | cuk2raliTime | 2.7 V-0.8V (Noto 3) | 
A31-A2 Valid Delay 62 
A31~A2 Float Delay 
BES-BEO, LOGK Valid Delay 62 
3-~BE0, LOCK Float Delay 
/Fi, M/O, D/C Valid Delay 62 
AOS 

/R, MIO, D/C, ADS Float Delay 
31—DO Write Data Valid Delay 63 
31-—DO Write Data Hold Time C, = 50 pF 

31-DO Float Delay 
LDA Valid Delay 62, 69 
LDA Float Delay 


NA Setup Time 
NA Hold Time 


Ref Figure 


De) 
is} 
3 = 
N 


33.3 


_ 


5 


i 


w mToeAlolo 
» 

Z{ZIZ/gigl=lal= 
a 

= 

a 

i) 

o 

*) 

< 
NT} 
wolo 


~ 
~N 


— 
Ls) 
o 


ns 


are 
eS 
Ls) 
oO 


= 
als 


=" 
n 


BS16 Setup Time 
BS16 Hold Time 
READY Setup Time 


READY Hold Time caren 


61 
61 


= = 
© ~ 


Lo Rae) 
oO 


nN} m% np rM 
NN] oO wy hy 


1 31—DO Read Setup Time 
31—D0 Read Hold Time 
OLD Setup Time 

24 HOLD Hold Time 


5 


3)> 
Ola 


ns 


61 
70 
70 


nh 


RESET Setup Time 
RESET Hold Time 


MI, INTR Setup Time 


( 
9 | PEREQ, ERROR, BUSY, (Note 2) 61 
FLT* Setup Time 
30 PEREQ, ERROR, BUSY, (Note 2) 
FLT* Hold Time 


Notes: 

*PQFP package only. 

1. Float condition occurs when maximum output current becomes less than io in magnitude. Float delay is not 100% tested. 

2. These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, 
to assure recognition within a specific clock period. 

3. Rise and fall times are not tested. 

4. Min time not 100% tested. 


3),3 [3 
a a ew 


EF 
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SWITCHING WAVEFORMS 


The switching characteristics consist of output delays, 
input setup requirements, and input hold requirements. 
All characteristics are relative to the CLK2 rising edge 
crossing the 2.0 V level. 


Switching characteristic measurement is defined by 
Figure 58. Inputs must be driven to the voltage levels 
indicated by this diagram. AM386DE CPU output delays 
are specified with minimum and maximum limits mea- 
sured as shown. The minimum Am386DE microproces- 
sor delay times are hold times provided to extemal 
circuitry. Am386DE microprocessor input setup and 
hold time are specified as minimums, defining the 


ol 
cane | 2V oe NN es 
® 
A31-A2, ADS, D/G = Min 
—A2, ADS, ' ” Ty 
TOOK, MIO, WiFi Vaid yoy SYP sy Valid 
BES_BEO, HLDA Output n ©? " RS K.° ¥ Output n+1 


D31-Do [ 


NA, BS16, 
INTR, NMI 


READY, HOLD, FLT, 
ERROR, BUSY, 
PEREQ, D31-Do 


Legend A—Maximum Output Delay Spec 
B—Minimum Output Delay Spec 


C—Minimum Input Setup Spec 
D—Minimum Input Hold Spec 


Note: input waveforms have tr< 2.0 ns from 0.8 V to 2.0 V. 


PRELIMINARY 


smallest acceptable sampling window. Within the sam- 
pling window, a synchronous input signal must be stable 
for correct Am386DE microprocessor operation. 


Outputs ADS, W/R, D/C, MAO, LOCK, BE3-BEO, 
A31-A2, and HLDA only change at the beginning of 
phase one. D31-D0 (write cycles) only change at the 
beginning of phase two. The READY, HOLD, BUSY, 
ERROR, PEREQ, FLT, and D31-D0 (read cycles) 
inputs are sampled at the beginning ot phase one. The 
NA, BS16, INTR, and NMI inputs are sampled at the 
beginning of phase two. 


Tx 
62 


2V 


Max 


Z) 


(A) 


-Oqe | 


Output n 


© ©) 


pee gemere (teen 
YS yas \N 
pee ESS 


© (9) 


3.0V pit I 
fs y Veni VES 
yee SS 


18421A-061 


Figure 58. Drive Levels and Measurement Points 
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Am386DE CPU Output re | 
Cc 


L 


C,, includes all parasitic capacitances. 


Figure 59. AC Test Load 18421A-062 


CLK2 


18421A-063 


Figure 60. CLK2 Timing 
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Hoo [ XY CURSES 
o-oo f yx URS 
t29 
na [ WY GEE BEE SS 
17 t18 
127 t28 
INTR, NMI [ NS GEE EEE. SW 


Figure 61. Input Setup and Hold Timing ees 


92 o1 Tx 62 01 


CLK2 [ 


BE3-BEO, 
LOCK 


W/R, MAO, 
D/C, ADS 


A31-A2 [ 


HLDA [ 


18421A-065 


Figure 62. Output Valid Delay Timing 
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18421A-066 
Figure 63. Write Data Valld Delay Timing (33 MHz) 
11 
61 $2 
18421A-067 
Figure 64. Write Data Hold Timing (33 MHz) 
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nom +6 


nom +3 


Output Valid Delay (ns) 


nom ~3 


nom ~6 


50 75 100 125 150 
C, (picofarads) 


Note: 
This graph will not be linear outside of the C, range shown. 
Figure 65. Typical Output Valid Delay Versus Load Capacitance 
at Maximum Operating Temperature (C, =120 pF) 


nom +9 


nom +6 


nom +3 


Output Valid Delay (ns) 


nom -3 


nom -6 
75 100 125 150 
C, (picofarads) 


Note: 
This graph will not be linear outside of the C, range shown. 


Figure 66. Typical Output Valid Delay Versus Load Capacitance 
at Maximum Operating Temperature (C, = 75 pF) 
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nom +9 
nom +6 
Output Valid Delay (ns) 
nom +3 
nom 
nom -3 
50 75 100 125 150 
C, (picofarads) 
Note: 
This graph will not be linear outside of the C, range shown. 18421A-070 


Figure 67. Typical Output Valid Delay Versus Load Capacitance 
at Maximum Operating Temperature (C, = 50 pF) 


Rise Time (ns) 0.8 V- 2.0 V 4 


50 75 100 125 150 
C, (picofarads) 


Note: 
This graph will not be linear outside of the C, range shown. 18421A-071 


Figure 68. Typical Output Rise Time Versus Load Capacitance 
at Maximum Operating Temperature (C, = 50 pF) 
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62 o1 o1 92 
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aes aa is Pin [| Max 
Tock [ (recent (eke ee 
acts aa is fin || Max 
oa [ So 
ne ce Min |_| Max 
narra [ = — 
ne Py Max eT Min | Max 
sy Ad 
D31—-Do ( _ (High Z) |— — 
113—Also applies to data float when write 
cycle is followed by read or idle 
t14 114 t14 tl4f 
[Mic "| Max [Twin | Max 
col Ray FR 


18421A-072 
Figure 69. Output Float Delay and HLDA Valid Delay Timing 


Initialization Sequence 


o2o0ro1 


92 91 


Note: 
The second internal processor phase following RESET High-to-Low transition (provided t25 and t26 are met) is 02. 


18421A-073 
Figure 70. RESET Setup and Hold Timing and Internal Phase 
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INSTRUCTION SET 


This section describes the AmM386DE microprocessor 
instruction set. A table lists all instructions along with 
instruction encoding diagrams and clock counts. Fur- 
ther details of the instruction encoding are then provided 
in the following sections, which completely describe the 
encoding structure and the definition of all fields occur- 
ring within Am386DE CPU instructions. 


Am386DE Microprocessor Instruction 
Encoding and Clock Count Summary 


To calculate elapsed time for an instruction, muitiply the 
instruction clock count, as listed in Table 24, by the pro- 
cessor clock period (e.g., 30 ns for a33-MHz Am386DE 
microprocessor). 


For more detailed information on the encodings of 
instructions refer to the Instruction Encodings sectionon 
page 112. This section explains the general structure of 
instruction encodings and defines exactly the encodings 
of all fields contained within the instruction. 


instruction Ciock Count Assumptions 


1. The instruction has been prefetched and decoded, 
and is ready for execution. 


2. Bus cycles do not require wait states. 


3. There are no local bus HOLD requests delaying 
processor access to the bus. 


4. No Exceptions are detected during instruction 
execution. 


5. If an effective address is calculated, it does not use 
two general register components. One register, 
scaling, and displacement can be used within the 
clock counts shown. However, if the effective 
address calculation uses two general register 
components, add 1 clock to the clock count shown. 


Instruction Clock Count Notation 


1. If two clock counts are given, the smaller refers toa 
register operand and the larger refers to a memory 
operand. 


2. n= number of times repeated. 


3. m= number of components in the next instruction 
executed, where the entire displacement (if any) 
counts as one component; the entire immediate data 
{if any) counts as one component; and each of the 
other bytes of the instruction and prefix(es) each 
count as one component. 
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Table 24. Am386DE Microprocessor Instruction Set Summary 


GENERAL DATA TRANSFER 
MOV = Move 


Register to Register/Memory 
Register/Memory to Register 

Immediate to Register/Memory immediate data 
Immediate to Register (short form) immediate dala 

Memory to Accumulator (shori form) full displacement 

Accumulator to Memory (shon form) full dispiacement 


Register/Memory lo Segment Register 
Segment Register to Register/Memory | 10001100 


MOVSX = Move with Sign Extension 


Ragister from Register/Memory 00001111 10111110 


MOVZX = Move with Zero Extension 

Register from Ragister/Memory Bie modreg = r/m 
PUSH = Push 

Ragister/Memory 

Register (short form) 01010 reg 


Segment Register (ES,CS,SS, or DS) | 000sreg2110 


Segmeni Register (FS or GS) 
Immediate immediate data 
PUSHA = Push All 

POP = Pop 

Register/Memory 
Register (shorl form) 

Segment Register (ES, SS, or DS) 

Segment Register (FS or GS) 
POPA = Pop All 

XCHG » Exchange 

Register/Memory with Register 
Register with Accumulator (short form) 

IN = Input from: 

Variable Port 


OUT = Output to: 


“IfCPLSIOPL *’ IECPL > IOPL 
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Table 24. AmM386DE Microprocessor Instruction Set Summary (continued) 


Instruction 


SEGMENT CONTROL 
LDS 2 Load pointer to DS 
LES = Load pointer to ES 
LFS = Load pointer to FS 
LGS = Load pointer to GS 
LSS = Load pointer to SS 
FLAG CONTROL 

CLC = Ciear Carry Flag 


CLD = Clear Direction Flag 


CLI = Claar Interrupt Enable Flag 
CLTS = Clear Task Switched Flag 


CMC = Complement Carry Flag 
LAHF = Load AH into Fiag 
POPF = Pop Flag 

PUSHF = Push Fiag 

SAHF = Store AH into Flag 
STC = Set Carry Flag 

STO = Set Direction Flag 

STI = Sal Interrupt Enable Flag 
ARITHMETIC 

ADD = Add 

Register to Register 

Register to Memory 

Mamory to Register 


Immediate to Register/Memory 


Immediate to Accumulator (short form) 


ADC = Add with carry 
Ragister to Register 
Register to Memory 
Mamory to Register 


Immediate to Register/Memory 


Immediate to Accumulator (shor form) 


INC = Increment 
Register/Memory 
Ragister (short farm) 
SUB = Subtract 
Register from Register 
Register from Memory 


Memory Irom Register 
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Format 


11000101 
11000100 
00001111 modreg t/m 


11111000 


11111100 


11111010 


00001111 00000110 


10011111 


10011100 


4111111014 


11111011 


000000dw 
ago000000w 
100000sw immediate dala 


og00010wW immediate data 


000100dw 
0001000w 
100000sw immediate data 


ediate dala 


Vd11111W mod 000 1/m 


01000 reg 
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Table 24. Am386DE Microprocessor Instruction Set Summary (continued) 


[Glock Count_[ Comments 
Real Protected Real |Protected 

Address | Address | Address] Address 

Mode Mode Mode | Mode 


ARITHMETIC (continued) 


immediate from Register/Memory 0010011W mod101 5m | immediate dala 
Immediate from Accumulator 0001110W immediate data 
(short form) 


SBB = Subtract with Borrow 


Register from Register 000110dw 
Register from Memory 0001100W 
Memory from Register 0001101W 
Immediate from Register/Memory 100000sw immediate data 


Immediate from Accumulator 0001110wW immediate data 


DEC = Decrement 


Register/Memory 1411111W 


Register (short form) 01001 reg 


CMP « Compare 


Ragister with Register 001110dw 
Memory with Register 0011100w 
Register with Memory 0011101w 
Immediate with Register/Memary 1O0000sw immediate data 


Immediate with Accumulatorishort form} 0011110w immediate data 


NEG = Change Sign 1411011wW 


AAA = ASCII Adjust tor Add 00110111 


OAA = Decimal Adjust for Add 

AAS = ASCil Adjuat for Subtract 00100111 

DAS = Decimal Adjust for Subtract | 00101111 

MUL 2 Multiply (Unsigned) 

Accumulator with Register/Memory 1111011W mod100 t/m 


Multiplier -Byte 12-17/15-20] 12-47/95-20 
-Word 12-25/15-28 ] 12-25/15-28 


-Doubleword 12-41/15-44 ] 12-41/15-44 
IMUL = Integer Muitiply (signed) 


Accumulator with Register/Memory 4111011W mod 101 tm 


Multiplier -Byle - 20] 12-17/15-20 


-Word 5 28] 12-25/15-2B 
-Doubleword z 4A 112-4 1/15-44 


Register with Register/Memory 00001111 | 10101111 


Multiplier -Byte - 20] 12-17/15-20 
-Word 2 12-25/15-28 
-Doubleword z -44112-41/15-44 

Register/Memory with Immediate 


to Register 011010s1 modreg fm immediate data 


-Word 13-26/14-27 
-Doubleword 13-42/14-43] 13-42/14-43 
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Table 24. Am386DE Microprocessor Instruction Set Summary (continued) 
[Clock Count_[ Comments _| 


instruction 
ARITHMETIC (continued) 
DIV = Divide (Unsigned) 


Accumulator by Register/Memory 
Divisor -Byte 
-Word 
-Doubleward 
IDIV = Integer Divide (Signed) 


Accumulator by Register/Memory 
Divisor -Byte 
-Word 
-Doubleword 
AAD = ASCII Adjust for Divide 


AAM = ASCIli Adjust for Multiply 


CBW = Convert Byte to Word 


CWD = Convert Word to 
Double Word 


LOGIC 
Shift/Rotate Instructions 


Format 


1111011wW mod110 rm 


mod111  t/m 


Not Through Carry (ROL, ROR, SAL, SAR, SHL, and SHR) 


Register/Memory by t 


Ragister/Memory by CL 


Register Memory by Immediate Count 


Through Carry (RCL and ACR) 
Register/Memory by 1 
Register/Memory by CL 


Alagister/Memory by Immediate Count 


SHLD = Shift Left Double 
Register/Memory by Immediate 
Register/Memory by CL 

SHRD = Shift Right Double 
Register/Memory by Immediate 
Register/Memory by CL 

AND = And 

Ragister to Register 

Register to Memory 

Memory to Regisiter 


Immediate to Register/Memary 


Register/Memary and Register 


Immediate Data and Register/Memory 


Immediate Data and Accumulator 
(short form) 


1101000w mod TTT = t/m 
1100000wW mod TTT «wm immediate 8-bit data 


dm 


1100000w immediate 8-bit data 
TT 


Instruction 
ROL 
ROR 
ACL 
RCR 

SHUSAL 
SHA 
SAR 


00001111 10100100 ee Pag ie: Petar Se tas 
00001111 10100101 


10101100 | modreg rm 
am 


001000dw vm 
0010000w mod reg vm 


001000iWw 


immediate 8-bt data 


immediate data 


modd00 f/m | immediate data 
1010100w | immediate data 


Am386DE Microprocessor 


Real | Protected 
Address | Address 
Mode Mode 


103 


ot amo PRELIMINARY 
Table 24. AmM386DE Microprocessor Instruction Set Summary (continued) 


LOGIC (continued) 
OR «Or 


Register to Register 

Memory to Register 

Immediate and Ragister/ Memory immediate dala 
Immediate to Accumutatar (short form) immediate data 


XOR = Exclusive or 


Register to Register 

Registro Memory 

Mamory to Register 

Immediate to Ragister/Memory immediate data 
Immediate to Accumulator (short form) immediate data 


STRING MANIPULATION 


CMPS = Compare Byta/Word | soroo11w | 10 
INS = Input Byte/Wd from DX Port gran 


LODS = Load Byte/Wd to AL/AX 1010110wW 5 


| sorostow | 
MOVS w Move Byte/Word 8 
OUTS « Output Byte/W4 to DX Port 8'/28"* 
SCAS w Scan Byte/Word 

STOS w= Store Byte/Word from ALAXX} 1010101W 


XLAT = Translate String 


REPEATED STRING MANIPULATION Repeated by Count in CX or ECX 
REPE CMPS = Compare string 


rovers) [Virion [rovworw 


REPNE CMPS = Compare String 


(Find Match) 
REP INS = Input String 114110010 
REP LODS s Load String 
REP MOVS = Move String eee | 
REP OUTS = Output String 


REPE SCAS = Scan String 


REPNE SCAS = Store String 


BIT MANIPULATION 


* HCPL < IOPL HCPL > IOPL 
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Table 24. AmM386DE Microprocessor Instruction Set Summary (continued) 


BIT MANIPULATION (continued) 
BT = Test Bit 


Register/Memory, Immediate 00001111 40111010 mod 10 0 r/m fimmeadiate 8-bit data 
Register/Memory, Register 00001111 10100011 modreg = r/m 


BTC = Test Bit and Complement 


Raegister/Memory, Immediate 00001111 10111010 mod111 /m Jimmediate 8-bil data 
Register/Memory, Register 00001111 10111011 modreg = t/m 


BTR = Test Bit and Reset 


Register/Memory, Immediate 00001111 p40111010 | mod 110 t/m Jimmediate 8-bit data 
Register/Memory, Ragister Ga Eee modreg = /m 


BTS = Test Bit and Set 


Register/Memary, immediate 00001111 10111010 mod10t f/m jimmediate B-bit dala 
Register/Memory, Register 00001111 ptororo14 | feg tim 


CONTROL TRANSFER 


CALL = Call 


Direct Within Segment 11101000 lull displacement 
Register/Memory 11191191 mod010 r/m 
Indirect Within Segment 


Direct Intersegment 10011010 unsigned full offset, selector 


Protected Mode Only (Direct intersegment) 
Via Call Gate to Same Privilege Level 
Via Call Gate to Different Privilege Level, (No Parameters) 
Via Cail Gate to Ditferant Privilege Level, (x Parameters) 
From 80286 Task to 80286 TSS 
From 80286 Task to Am386DE CPU TSS 
From Am386DE CPU Task to 80286 TSS 
From Am386DE CPU Task to Am386DE CPU TSS 


Indirect Intersagment V11909919 mod011 r/m 


Protected Mode Only (Indirect Intersegment) 


Via Call Gate to Sarne Privilege Level 

Via Call Gate to Ditterent Privilege Level {No Parameters} 
Via Call Gate to Diffarent Privilege | evel (x Parameters) 
From 80286 Task to 80286 TSS 

From 80286 Task to Am386DE CPU TSS 

From Am386DE CPU Task to 80286 TSS 

From Am386DE CPU Task to Am386DE CPU TSS 


JMP = Unconditional Jump 
Short 11101011 8-bil displacement 


Direct within Segment full displacement 


Register/Memory 1444tt44 
{Indirect within Segment 
Direct Intersegment 11101010 unsigned full offset, selector 
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Table 24. Am386DE Microprocessor Instruction Set Summary (continued) 


CONTROL TRANSFER (continued) 


Protected Mode Only (Diract Intersegment) 


Via Cali Gate to Same Privilege Level 

From 80286 Task to 80286 TSS 

From 80286 Task to AM386DE CPU TSS 

From Am3B6DE CPU Task to 80286 TSS 

From Am386DE CPU Task to Am386DE CPU TSS 


Indirect Intersegment 11411114 mod101 f/m 


Protectad Mode Only (Indirect Intersegmant) 


Via Call Gate to Same Privilege Level 

From 80286 Task to 80286 TSS 

From 80286 Task to Am386DE CPU TSS 

From Am386DE CPU Task to B0286 TSS 

From Am386DE CPU Task to Am386DE CPU TSS 


RET = Retum from CALL 


Within Segment 
Within Seg. Adding Immediate ta SP 
Intersegment 
Intersegment Adding Immediate to SP 


Protected Mode Only (RET) to Different Privilege Level 
Intersagment 
Intersagment Adding Immediate to SP 


CONDITIONAL JUMPS (Note: Times are Jump “Taken or Not Taken”) 


JO = Jump on Overflow 


8-bit Displacement 01110000 
Full Displacement 00001111 yaq000000 full displacement 


JNO = Jump on Not Overtiow 


8-bit Displacement 01110001 8-bit displacement 7+emorsd 
Full Displacement 00001111 10000001 full displacement 7+mor3 


JB/JNAE = Jump on Betow/Not Above or Equal 


bi lplacemon ai1to0n0 
Full Displacement 00001411 10000010 jull displacement 


JNB/JAE = Jump on Not Below/Above or Equal 


Full Displacement 00001111 10000011 tull displacement 


JE/JZ = Jump on Equal/ Zero 


8-bit Displacement 01110100 
| Displacement 00001111 10000100 full displacement 


E/JNZ = Jump on Not Equal/Not Zero 


014110101 B-bil displacement 
| Displacement 00001111 10000101 full displacement 


E/JNA = Jump on Below or Equal/Not Above 


86-bit Displacement 01110110 
Full Displacement 10000110 tutl displacement 
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Table 24. Am386DE Microprocessor Instruction Set Summary (continued) 


Real Protected 
. Address | Address 
Instruction Format Mode Mode 


CONDITIONAL JUMPS (continued) 
JNBE/JA = Jump on Not Below or Equal/Above 


8-bit Displacement 7+emorg 
Full Displacement full displacement 7+m or 3 
JS = Jump on Sign 

8-bil Displacement 

Full Displacement full displacement 


JNS = Jump on Not Sign 


B-bit Displacement 01111001 7+m orga 
Full Displacement 00001111 10001001 full displacement 7+m or 3 


JP/JPE = Jump on Parity/Parity Even 


Full Displacement 10001010 = | fulldisplacement 7+m of 3 


JNP/JPO = Jump on Not Parity/Parity Odd 


8-bit Displacement 01111011 7+mor3 


Full Displacement 00001111 10001011 fuli displacement 7+mor3 


JL/JNGE = Jump on Less/Not Greater or Equal 


8-bit Displacement 01111100 B-bit displacement 7+moar3 
Full Displacement 00001111 10001100 full displacement 7+mor3 


JNLJGE = Jump on Not Less/Greater or Equal 


8-bit Displacement 7+m or 3 
Full Displacement full displacement Temoar3 
JLE/JNG = Jump on Less or Equai/Not Greater 

it Displacement 74m or 3 
Displacement 10001110 fulldisplacement 7+m org 


JNLE/JG = Jump on Not Less or Equal/Greater 


8-bit Displacement 8-bil displacement 7+mor3 


Full Displacement aoo0o0t1i11 10001111 full displacement 7+m or3 


JCXZ = Jump on CX Zero* B-bit displacement 9+mor5 
JECXZ = Jump on ECX Zero* 8-bit displacement 9+m or § 
LOOP = Loop CX Times B-bit displacement 1am 


LOOPZ/LOOPE = Loop with Zero/Equal B-bit displacement 14m 
LOOPNZ/LOOPNE=1.cop white Not Zerc 8-bit displacement Wm 
CONDITIONAL BYTE SET (Note: Times are Register/Memory) 

SETO = Set Byte on Overtlow 


To Ragister/Memory 00001111 10010000 mod 000 tm 


SETNO = Sat Byte on Not Overflow 


To Register/Memory oo000111?4 10010001 mod000 tm 


SETB/SETNAE = Set Byte on Below/Not Above or Equal 


To Register/Memory 00001111 10010010 mod 


* Address Size Prefix Diflerentiates JCXZ from JECXZ 
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Table 24. Am386DE Microprocessor Instruction Set Summary (continued) 


CONDITIONAL BYTE SET (continued) 

SETNB = Set Byte on Not Below/Above or Equal 

To Register/Memory 00001111 

SETE/SETZ = Set Byte on Equal/Zero 

To Register/Memory 00001111 

SETNE/SETNZ = Set Byte on Not Equal/Not Zero 

To Register/Memory 00001111 10010101 
SETBE/SETNA = Set Byte on Beiow or Equal/Not Above 


To Register/Memary 00001111 10010110 


SETNBE/SETA = Set Byte on Not Below or Equal/Above 


To Register/Memory 00001111 10010111 


SETS = Set Byte on Sign 


To Register/Memory oo000t111 10011000 


SETNS = Set Byte on Not Sign 


To Register/Memory 00001111 10011001 


SETP/SETPE = Set Byte on Parity/Parity Even 


To Register/Memory 00001111 


SETNP/SETPO = Set Byte on Not Parity/Partty Odd 


To Register/Mamory 00001111 
SETLSETNGE « Set Byte on Less/Not Greater or Equal 


To Register/Memory 00001111 10011100 


SETNL/SETGE = Set Byte on Not Less/Greater or Equal 


To Register/Memory ooo0o1it! 


SETLE/SETNG =z Sat Byte on Loss or Equat/Not Greater 


To Ragisier/Memory 00001111 10011110 


SETNLE/SETG = Set Byte on Not Less or Equal/Greator 


To Register/Memory mod000 rm 


ENTER « Entor Procedure 16-bit displacement, 8-bit level 
L=9 
Lat 
L>1 


LEAVE = Leave Procedure 


INTERRUPT INSTRUCTIONS 
INT = Interrupt: 


INTO = Interrupt 4 if Overflow Flag Set] 11001110 
HOF = 1 


If OF =O 
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10 
12 
15+4(n-1) 


4 


4S 


10 
12 
1544(n-1) 


4 
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Tabie 24. Am386DE Microprocessor Instruction Set Summary (continued) 


INTERRUPT INSTRUCTIONS (continued) 


Bound = Interrupt 5 If Detect 01100010 
Value Out of Range 


If Out of Range 
Ifin Range 


Protected Mode Only (INT) 
INT: Type Specified 
Via Interrupt or Trap Gate to Same Privilege Level 
Via Interrupt or Trap Gale to Ditferent Privilege Level 
From 80286 Task to 80286 TSS via Task Gate 
From 80286 Task 10 AM386DE CPU TSS via Task Gate 
From Am386DE CPU Task to 80286 TSS via Task Gate 
From Am3B6DE CPU Task to Am386DE CPU TSS via Task Gate 


INT: Type 3 

Via Interrupt or Trap Gate to Same Privilege Level 

Via Interrupt or Trap Gate to Different Privilege Level 

From 80286 Task to 80286 TSS via Task Gale 

From 80286 Task to AM386DE CPU TSS via Task Gale 

From Am386DE CPU Task to 80286 TSS via Task Gate 

From Am386DE CPU Task to Am386DE CPU TSS via Task Gate 


INTO 
Via Interrupt or Trap Gate to Same Privilege Level 
Via Interrupt or Trap Gate to Ditlerent Privilege Level 
From B0286 Task to 80286 TSS via Task Gate 
From 80286 Task to AmM386DE CPU TSS via Task Gate 
From Am386DE CPU Task to 80286 TSS via Task Gate 
From Am386DE CPU Task to Am386DE CPU TSS via Task Gate 


BOUND 

Via Interrupt or Trap Gate to Same Privilege Level 

Via Interrupt or Trap Gate to Different Privilage Level 

From 80286 Task to 80286 TSS via Task Gate 

From 80286 Task to AM386DE CPU TSS via Task Gate 

From Am386DE CPU Task to 80286 TSS via Task Gale 

From Am386DE CPU Task to Am386DE CPU TSS via Task Gate 


INTERRUPT RETURN 


IRET = Interrupt Return 11001111 


Protected Mode Only (RET) 
To the Same Privilege Level (within Task) 
To Different Privilege Level (within Task) 
From 80286 Task to 80286 TSS 
From 80286 Task ta AM386DE CPU TSS 
From Am386DE CPU Task to 80286 TSS 
From Am386DE CPU Task to Am386DE CPU TSS 
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Table 24. Am386DE Microprocessor Instruction Set Summary (continued) 
[Clock Count [Comments] 


Real 
Addr: 


PROCESSOR CONTROL 

WT WALT 

MOV = Move to and From Control/Debug/Test Registers 

CRO/CR2/CR3 from register 11/4/5 11/4/5 
Register FroCR3m -0 6 6 
OR3-0 From Register 22 
DR7-6 From Register 16 
Register trom DR7-6 14 
Register from DR3-0 22 
TRA?7-6 from Register 12 
Register trom TR7000011114 

NOP = No Operation 


WAIT = Wait until BUSY 
pin is negated 


NOP = No Operation 


PROCESSOR EXTENSION INSTRUCTIONS 


Processor Extension Escape V1011TTT modlL LL vm 


TTT and LLL bils are op-code information lor coprocessor 
PREFIX BYTES 
Address Size Profix 
LOCK = Bus Lock Prefix 11110000 


Operand Size Pretix 01100110 


Segment Override Proilx 
cs: 00101110 
OS: 
ES: 001001190 
FS: 01100100 
GS: 01100101 
SS: 00110110 

PROTECTION CONTROL 

ARPL « Adjust Requested Privilege Level 


From Register/Memory 

LAR = Load Access Rights 

From Ragister/Memory 

LGOT = Load Global Descriptor 

Table Register 

LIDT = Load interrupt Descriptor 

Table Register mod011 f/m 
LLDT = Load Local Descriptor 


Table Ragister to Register/Memoary 00001111 oa000000 


LMSW = Load Machine Status Word 


From Register/Memoary aoooit1t 00000001 
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Table 24. AM386DE Microprocessor instruction Set Summary (continued) 
[Glock Count [Comments] 


Protected Protected 
Reel Vietuat Real Virtual 
Address | Addrees [ Address] Addrees 
Mode Mode Mode Mode 


Instruction 
PROTECTION CONTROL (continued) 


LSL = Load Segment Limit 


From Ragister/Memory 


Byte-Granular Limit 
Page-Granular Limit 
LTR = Load Task Register 


From Register/Memory 

SGDT = Store Global Descriptor 
Table Register 

SIDT = Store Interrupt Descriptor 
Table Register 

SLDT = Store Local Descriptor Table Register 
To Ragister/Memory 
SMSW = Store Machine Status Word] 00001111 
STR = Store Task Register 
To Register/Memory 

VERR = Verify Read Access 
Register/Memory 

VERW = Vority Write Access 


Instruction Notes for Table 23. 


Notes a through c apply to Am386DE CPU Real Address 
Mode only. 


a. This is a Protected Mode instruction. Attempted execution 
in Real Mode will result in Exception 6 (Invalid op-code). 


b. Exception 13 fauk (General Protection) will occur in Real 
Modée if an operand reference is made that partially or fully 
extends beyond the maximum CS, DS, ES, FS, or GS limit, 
FFFFH. Exception 12 (fault stack segment limit violation or 
not present) will occur in Real Mode if an operand reference 
is made that partially or fully extends beyond the maximum 
SS limit. 


c. This instruction may be executed in Real Mode. In Real 
Mode, its purpose is primarily to initialize the CPU for 
Protected Mode. 

Notes d through g apply to Am386DE CPU Real Address 

Mode and Am386DE CPU Protected Address Mode. 


d, The Am386DE CPU uses an early-out multiply algorithm. 
The actual number of clocks depends on the position of the 
most significant bit in the operand (muttiplier). Clock counts 
given are minimum to maximum. To calculate actual clocks, 
use the following formula: Actual Clock = if m < > 0 then 
max (flog, /m|], 3} + b clocks: if m = 0 then 3 + b clocks. 
In this formula, m is the muttiplier, and 
b = 9 for register to register, 

b = 12 for memory to register, 
6 = 10 for register with immediate to register, 
b = 11 for memory with immediate to register. 


An Exception may occur, depending on the value of the 
operand. 


2 


00000011 


reg ifm 


00000000 moddo1 t/m 
00000001 modd0d0 f/m 
00000001 mod 001 f/m 


00001111 00000000 mod000 rm 


00001111 a0000000 mod001 «t/m 


fooo0r111 | oooc00c0 mod100 1/m 
00001111 |o00000000 


100 tim 


f LOCK is automatically asserted, regardless of the 
presence or absence of the LOCK prefix. 

g. COCK is asserted during descriptor table accesses. 

Notes h through r apply to AmM386DE CPU Protected 

Address Mode only. 

h. Exception 13 fault (General Protection Violation) will occur 
if the memory operand in CS, DS, ES, FS, or GS cannot be 
used due to either a segment limit violation or access rights 
violation. If a stack limit is violated, an Exception 12 (Stack 
Segment Limit Violation or Not Present) occurs. 


i. Forsegment load operations, the CPL, RPL, and DPL must 
agree with the privilege rules to avoid an Exception 13 fault 
(General Protection Violation). The segment’s descriptor 
must indicate present or Exception 11 (CS, DS, ES, FS, GS 
Not Present). if the SS register is loaded and a stack 
segment not present is detected, an Exception 12 (Stack 
Segment Limit Violation or Not Present) occurs. 


j. All segment descriptor accesses in the GDT or LDT made 
by this instruction will automatically assert LOCK to 
maintain descriptor integrity in multiprocessor systems. 

k. JMP, CALL, INT, RET, and IRET instructions referring to 
another code segment will cause an Exception 13 (General 
Protection Violation) if an applicable privilege rule is 
violated. 

1 An Exception 13 fault occurs if CPL is greater than 0 (Gis the 
most privileged level). 

m.An Exception 13 fault occurs if CPL is greater than (OPL. 

n. The IF bit of the flag register is not updated if CPL is greater 


than IOPL. The IOPL field of the flag register is updated only 
if CPL = 0. 
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o. The PE bit of the MSW (CRO) cannot be reset by this 
instruction. Use MOV into CRO if desiring to reset the PE bit. 

p. Any violation of privilege rules as applied to the selector 
operand does not cause a protection Exception; rather, the 
zero flag is cleared. 

q. If the coprocessor’s memory operand violates a segment 
limit or segment access rights, an Exception 13 fault 
(General Protection Exception) will occur before the ESC 
instruction is executed. An Exception 12 fault (Stack 
Segment Limit Violation or Not Present) will occur if the 
stack limit is violated by the operand's starting address. 

r. The destination of a JMP. CALL, INT, RET, or RET must be 
in the defined fimit of a code segment or an Exception 13 
fault (General Protection Violation) will occur. 


Instruction Encoding 
Overview 


All instruction encodings are subsets of the general 
instruction format shown in Figure 71. Instructions con- 
sist of one or two primary op-code bytes, possibly an 
address specifier consisting of the mod r/m byte and 
scaled index byte, a displacement if required, and an 
immediate data field if required. 


Within the primary op-code or op-codes, smaller encod- 
ing fields may be defined. These fields vary according to 
the class of operation. The fields define such informa- 
tion as direction of the operation, size of the displace- 
ments, register encoding, or sign extension. 


Almost all instructions referring to an operand in 
memory have an addressing mode byte following the 
primary op-code byte(s). This byte, the mod r/m byte, 
specifies the address mode to be used. Certain 
encodings of the mod r/m byte indicate a second 
addressing byte, the scale-index-base byte, follows the 
mod r/m byte to fully specify the addressing mode. 


Addressing modes can include a displacement immedi- 
ately following the mod r/m byte, or scaled index byte. If 
a displacement is present, the possible sizes are 8, 16, 
or 32 bits. 


If the instruction specifies an immediate operand, the 
immediate operand follows any displacement bytes. 
The immediate operand, if specified, is always the last 
field of instruction. 


Figure 71 illustrates several of the fields that can appear 
in an instruction, such as the mod field and the r/m field, 
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but the Figure does not show all fields. Several smaller 
fields also appear in certain instructions, sometimes 
within the op-code bytes themselves. Table 25 is a com- 
plete list of all fields appearing in the AM386DE micro- 
processor instruction set. 


32-Bit Extensions of the instruction Set 

With the Am386DE microprocessor, the 8086/ 
80186/80286 instruction set is extended in two orthogo- 
nal directions: 32-bit forms of all 16-bit instructions are 
added to support the 32-bit data types, and 32-bit 
addressing modes are made available for all instruc- 
tions referencing memory. This orthogonal instruction 
set extension is accomplished having a Default (D) bit in 
the code segment descriptor, and by having 2 pretixes to 
the instruction set. 


Whether the instruction defaults to operations of 16 or 
32 bits depends on the setting of the D bit in the code 
segment descriptor, which gives the default length 
(either 32 or 16 bits) for both operands and effective 
addresses when executing that code segment. In the 
Real Address Mode, no code segment descriptors are 
used, but a D value of 0 is assumed internally by the 
Am386DE microprocessor when operating in this mode 
(for 16-bit default sizes compatible with the 
8086/80186/80286). 


Two prefixes, the Operand Size Prefix and the Effective 
Address Size Prefix, allow overriding individually the 
Default selection of operand size and effective address 
size. These prefixes may precede any op-code bytes 
and aftect only the instruction they precede. If neces- 
Sary, one or both of the prefixes may be placed before 
the op-code bytes. The presence of the Operand Size 
Prefix and the Effective Address Prefix will toggle the 
operand size or the effective address size, respectively, 
to the value opposite tram the Default setting. For exam- 
ple, if the default operand size is for 32-bit data opera- 
tions, then presence of the Operand Size Prefix toggles 
the instruction to 16-bit data operation. As another 
example, if the default effective address size is 16 bits, 
presence of the Effective Address Size prefix toggles 
the instruction to use 32-bit effective address 
computations. 
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TTTTTTTMTTT TTT TT] mod TTT cm d32|16| 8 |none data 32| 16 | 8 | none 


7 Oo 7 0765320 765320 
a ee 
opcode mod r/m s-i-b address immediate 
(one or two bytes) byte byte displacement data 
(T represents an opcode bit) uc —_—_——’”,_ (4,2, 1 bytes (4, 2, 1 bytes 
register and address or none) or none) 
mode specifier 

Figure 71. General Instruction Format 18421A-074 


Tabie 25. Flelds within Am386DE Microprocessor Instructions 


Field Name Number of Bits 


Specifies if Data is Byte or Full Size (Full Size is either 16 or 32 bits) 1 
Specifies Direction of Data Operation 1 
Specifies if an Immediate Data Field must be Sign-Extended 1 
reg General Register Specifier 3 
mod r/m Address Mode Specifier (Effective Address can be a General Register) 2 for mod; 


3 for r/m 


ss Scale Factor for Scaled Index Address Mode 

index General Register to be used as Index Register 

base General Register to be used as Base Register 

sreg2 Segment Register Specifier for CS, SS, DS, ES 

sreg3 Segment Register Specifier for CS, SS, DS, ES, FS, GS 

tttn For Condition Instructions, specifies a Condition Asserted or a Condition Negated 


Note: Table 23 shows encoding of individual instructions. 


These 32-bit extensions are available in all AM386DE 
microprocessor modes, including the Real Address 
Mode. In these modes the default is always 16 bits, so 
prefixes are needed to specify 32-bit operands or 
addresses. For instructions with more than one prefix, 
the order of prefixes is unimportant. 


Operand Size Operand Size 
During 16-Bit During 32-Bit 
Data Operations | Data Operations 


8 Bits 8 Bits 
16 Bits 32 Bits 
Unless specitied otherwise, instructions with 8- and 


16-bit operands do not affect the contents of the high- Encoding of General Register (reg) Field 


order bits of the extended registers. The general register is specified by the reg field, which 
Encoding of Instruction Fields may appear in the primary op-code bytes, or as the reg 

ra : : : ee : field of the mod r/m byte, or as the r/m field of the mod 
Within the instruction are several fields indicating regis- r/m byte. 


ter selection, addressing mode and so on. The exact 
encodings of these fields are defined immediately 
ahead. 


Encoding of Operand Length (w) Field 


For any given instruction performing a data operation, 
the instruction is executing as a 32- or 16-bit operation. 
Within the constraints of the operation size, the w field 
encodes the operand size as either one byte or the full 
operation size, as shown in the following table. 
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Encoding of reg Field When w Field 
is not Present In Instruction 


Register Selected | Register Selected 
During 16-Bit During 32-Bit 
teg Field | Data Operations | Data Operations 
AX 


Encoding of reg Field When w Field 
1s Present In Instruction 


Register Specified by reg Field 
During 16-Bit Data Operations 
Function of w Field 


Encoding of Segment Reg/ster (sreg) Fleld 

The sreg field in certain instructions is a 2-bit field allow- 
ing one of the four 80286 segment registers to be speci- 
tied. The sreg field in other instructions is a 3-bit field, 
allowing the AM386DE microprocessor FS and GS seg- 
ment registers to be specified. 
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2-Bit sreg2 Fleld 


2-Bit sreg2 Field Segment Register Selected 


00 ES 


01 cs 
10 ss 
11 DS 


3-Bit sreg3 Fleld 


3-Bit sreg3 Field Segment Register Selected 


ES 
cs 
Ss 


DS 

FS 

GS 
do not use 
do not use 


Encoding of Address Mode 


Except for special instructions, such as PUSH or POP, 
where the addressing mode is pre-determined, the 
addressing mode for the current instruction is specified 
by addressing bytes following the primary op-code. The 
primary addressing byte is the mod r/m byte, and a 
second byte of addressing information, the scale-index- 
base (s-i-b) byte, can be specified. 


The s-i-b byte is specified when using 32-bit addressing 
mode and the mod r/m byte has r/m = 100 and mod = 00, 
01, or 10. When the s-i-b byte is present, the 32-bit 
addressing mode is a function of the mod, ss, index, and 
base fields. 


The primary addressing byte, the mod r/m byte, also 
contains three bits (shown as TTT in Figure 83) some- 
times used as an extension of the primary op-code. The 
three bits, however, may also be used as a register field 
(reg). 


When calculating an effective address, either 16-bit 
addressing or 32-bit addressing is used. 16-bit addres- 
sing uses 16-bit address components to calculate the 
effective address while 32-bit addressing uses 32-bit 
address components to calculate the effective address. 
When 16-bit addressing is used, the mod r/m byte is 
interpreted as a 16-bit addressing mode specifier. When 
32-bit addressing is used, the mod r/m byte is inter- 
preted as a 32-bit addressing mode specifier. 


Tables on the following pages define all encodings of all 
16- and 32-bit addressing modes. 
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Encoding of 16-Bit Address Mode with mod r/m Byte 


Effective Address Effective Address 


DS{BX + SI] DS[BX + Si + d16] 
DS{BX + DI] DS [BX + DI + d16] 
SS{BP + SI] SS[BP + SI + d16] 
DS [BP + Dl] SS{BP + DI + d16] 
DS {SI DS{SI + d16] 
DS{DI} DS{[DI + d16] 
DS:d16 SS{BP + d16] 
DS [Bx] DS[BX + d16] 


DS:[BX + Sl + d8] Register— See Below 
DS:[BX + DI + d8} Register— See Below 
SS:[BP + SI + d8] Register— See Below 
SS[BP + DI + d8] Register— See Below 
DS[SI + d8} Register— See Below 
DS {DI + d8] Register— See Below 
SS:[BP + d8] Register— See Below 
DS:[BX + d8)} Register— See Below 


Register Specified by r/m Register Specified by r/m 
During 32-8it Data Operations During 16-Bit Data Operations 


Function of w Field Function of w Field 
modrim [~ (whenw=0) | (whenw=1) | |_modrim | (whenw=0) | (whenw=1) _| 
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Encoding of 32-Bit Address Mode with mod r/m byte 
(No s-l-b Present) 


DS:[EAX] DS:[EAX + d32] 
DS:{ECX] DS:{ECX + d32] 
DS:[EDX] DS:[EDX + d32] 
DS:[EBX] DS-:[EBX + d32] 
s-i-b is present s-i-b is present 
DS:d32 SS;EBP + d32] 
DS:{ESI] DS{ESI + d32] 
DS:{EDI} DS:[EDI + d32} 


DS:[EAX + d8] Register— See Below 
DS{ECX + d8] Register— See Below 
DS:EDX + d8] Register— See Below 
DS:[EBX + d8} Register— See Below 
s-i-b is present Register— See Below 
SS[EBP + 08] Register— See Below 
DS[ESI + d8] Register— See Below 


DS:[EDI + d8} Register— See Below 


Register Specified by reg or r/m Register Specified by reg or r/m 
During 32-Bit Data Operations During 16-Bit Data Operations 


Function of w Field Function of w Field 
rm modrim |" (whenw=0) | (whenw=1) _| 


AL 
CL 
DL 
BL 
AH 
CH 
DH 
BH 
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Encoding of 32-Bit Address Mode 
(mod r/m Byte and s-i-b present) 


Ettective Address 


OS[EAX + (scaled index)] 
OSECX + (scaled index)] 
DS{EDX + (scaled index)] 
DS[EBX + (scaled index)] 
SSESP + (scaled index)} 
DS:[d32 + (scaled index)] 
DS:[ESI + (scaled index)] 
DS{EDI + (scaled index)] 


DS:[EAX + (scaled index) + d8] 
DS:[ECX + (scaled index) + d8} 
OS:{EDX + (scaled index) + d8] 
DS:[EBX + (scaled index) + d8] 
SS:[ESP + (scaled index) + d8] 
SS{EBP + (scaled index) + d8] 
DS:[ESI + (scaled index) + d8} 

DS:[EDI + (scaled index) + d8] 


DS:[EAX + (scaled index) + d32] 
DS:[ECX + (scaled index) + d32] 
DS:[EDX + (scaled index) + d32] 
DS:[EBX + (scaled index) + d32] 
SS:[ESP + (scaled index) + d32} 
SSEBP + (scaled index) + d32] 
DS:[ESI + (scaled index) + d32] 

DS:[EDt + (scaled index) + d32] 


Note: 


Mod field in mod 1/m byte; ss, index, base fields in s-i-b byte. 


es ee 
0 x1 


01 
10 
11 


EAX 
ECX 
EDX 
EBX 
no index rag (see note) 
EBP 
ESI 
EDI 


Note: 

When index field is 100, indicating no index 
register, then ss field must equal 00. If index is 
100 and ss does not equal 00, the effective 
address is undefined. 


Encoding of Operation Direction (d) Field 


in many two-operand instructions the d field is present to 
indicate which operand is considered the source and 
which is the destination. 


| a | Direction of Operation 


Register/Memory €— Register 

reg Field indicates Source Operand; 
mod r/m or mod ss index base indicates 
Destination Operand. 


Register <— Register Memory 

reg Field indicates Destination Operand; 

mod r/m or mod ss index base indicates Source 
Operand. 


Encoding of Sign-Extend (s) Field 


The s field occurs primarily to instructions with immedi- 
ate data fields. The s field has an effect only if the size of 
the immediate data is 8 bits and is being placed in a 16- 
or 32-bit destination. 


Effect on Effect on 
Immediate Data 8 Immediate Data 16/32 
4 | Sign-Extended Data 8 to fill 
16-Bit or 32-Bit Destination 
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Encoding of Conditional Test (tttn) Fleld When Interpreted as Test Register Field 


TRE 
TR? 


For the conditional instructions (conditional jumps and 
set on condition), tttn is encoded with n indicating to use 
the condition (n=0) or its negation (n=1), and ttt giving 
the condition to test. 


Overflow 
No Overflow 


Below/Not Above or Equal 
Not Below/Above or Equal 
Equal/Zero 

Not Equal/Not Zero 

Below or Equal/Not Above 


Not Below or Equal/Above 

Sign 

Not Sign 

Parity/Parity Even 

Not Parity/Parity Odd 

Less Than/Not Greater or Equal 

Not Less Than/Greater or Equal 

Less Than or Equal/Not Greater Than 
Not Less Than or Equal/Greater Than 


Encoding of Control or Debug or Test Register 
(eee) Fleld 


For the loading and storing of the Control, Debug and 
Test registers. 


When Interpreted as Control Register Fleld 


000 CRO 
010 CR2 
011 CR3 


Do not use any other encoding. 


When Interpreted as Debug Register Field 


Do not use any other encoding. 
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Do not use any other encoding. 


ELECTRICAL DATA 
Introduction 


The following sections describe recommended electri- 
cal connections for the Am386DE microprocessor and 
its electrical specifications. 


Power and Grounding 
Power Connections 


The Am386DE CPU is implemented in CS21S technol- 
ogy and has modest power requirements. However, its 
high clock frequency and 72 output buffers (address, 
data, contro!, and HLDA) can cause power surges as 
muttiple output buffers drive new signal levels simulta- 
neously. For clean on-chip power distribution at high fre- 
quency, 20 Vcc and 21 Vss pins separately feed 
functional units of the Am386DE CPU. 


Power and ground connections must be made to all 
external Vcc and GND pins of the Am386DE CPU. On 
the circuit board, all Vcc pins must be connected on a 
Vcc plane. All Vss pins must be likewise connected to 
the GND plane. 


Power Decoupling Recommendations 


Liberal decoupling capacitance should be placed near 
the Am386DE CPU. The Am386DE microprocessor 
driving its 32-bit parallel address and data buses at high 
frequencies can cause transient power surges, particu- 
larly when driving large capacitive loads. 


Low inductance capacitors and interconnects are rec- 
ommended for best high frequency electrical perfor- 
mance. Inductance can be reduced by shortening circuit 
board traces between the Am386DE microprocessor 
and decoupling capacitors as much as possible. 
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Resistor Recommendations 


The ERROR, FLT, and BUSY inputs have resistor pull- 
ups of approximately 20 Kohms built into the Am386DE 
CPU to keep these signals negated when no 387DX 
math coprocessor is present in the system (or tempo- 
rarily removed from its socket). The BS16 input aiso has 
an internal pull-up resistor of approximately 20 Kohms, 
and the PEREQ input has an internal pull-down resistor 
of approximately 20 Kohms. 


In typical designs, the external pull-up resistors are re- 
commended. However, a particular design may have 
reason to adjust the resistor values recommended here, 
or alter the use of pull-up resistors in other ways. 


Other Connection Recommendations 


For reliable operation, always connect unused inputs to 
an appropriate signal level. NC pins should always 
remain unconnected. 


Particularly when not using interrupts or bus hold, (as 
when first prototyping, perhaps) prevent any chance of 
spurious activity by connecting these associated inputs 


to GND. 
Pin Signal 
B7 INTR 
B8 NMI 
D14 HOLD 


If not using address pipelining, pull-up D13 NA to Vec. 
If not using 16-bit size, pull-up C14 BS16 to Voc. 
Pull-ups in the range of 20 Kohms are recommended. 
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oN amo PRELIMINARY 
PHYSICAL DIMENSIONS 


PQB 132 — Plastic Quad Flat Pack; Trimmed and Formed 
(all measurements are in inches) 
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Microsoft at Work is a trademark of Microsoft, Corp. 
Product names used in this publication are for identification purposes only and may be trademarks of their respective companies. 
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